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WO 00/44906 PCT/USOO/02200 
GENES IDENTIFIED AS REQUIRED FOR PROLIFERATION IN 

ESCHERICHIA CQU 

BACKGROUND OF THE INVENTION 

Since the discovery of penicillin, the use of antibiotics to treat the ravages of bacterial infections has saved 

5 mfllions of Gves. With the advent of these "miracle drugs," for a time it was popularly believed that humanity might, once 

and for all, be saved from the scourge of bacterial infections. In fact, during the 1980s and early 1990s, many large 

pharmaceutical companies cut back or eliminated antibiotics research and development. They believed that infectious 

disease caused by bacteria finally had been conquered and that markets for new drugs were limited. Unfortunately, 

this belief was overly optimistic. 

10 The tide is beginning to turn in favor of the bacteria as reports of drug resistant bacteria become more frequent. 

The United States Centers for Disease Control announced that one of the most powerful known antibiotics, vancomycin, 
was unable to treat an infection of the common Staphylococcus aureus (staph). This organism is commonly found in our 
environment and is responsible for many nosocomial infections. The import of this announcement becomes clear when 
one considers that vancomycin was used for years to treat infections caused by stubborn strains of bacteria, like 

15 staph, in short, the bacteria are becoming resistant to our most powerful antibiotics. If this trend continues, it is 
conceivable that we will return to a time when what are presently considered minor bacterial infections are fatal 
diseases. 

There are a number of causes for the predicament in which practitioners of medical arts find themselves. Over- 
prescription and improper prescription habits by some physicians have caused an indiscriminate increase in the availability 
20 of antibiotics to the public. The patent is also partly responsible, for even in instances where an antibiotic is the 
appropriate treatment, patients will often improperly use the drug, the result being yet another population of bacteria that 
is resistant, in whole or in part, to traditional antibiotics. 

The bacterial scourges that have haunted humanity remain, in spite of the development of modem scientific 
practices to deal with the diseases that they cause. Drug resistant bacteria are now advancing on the health of humanity. 
25 A new generation of antibiotics to once again deal with the pending health threat that bacteria present is required. 

Discovery of New Antibiotics 
As more and more bacterial strains become resistant to the panel of available antibiotics, new compounds 
are required. In the past, practitioners of pharmacology would have to rely upon traditional methods of drug discovery 
to generate novel, safe and efficacious compounds for the treatment of disease. Traditional drug discovery methods 
30 involve blindly testing potential drug candidate-molecules, often selected at random, in the hope that one might prove 
to be an effective treatment for some disease. The process is painstaking and laborious, with no guarantee of 
success. Today, the average cost to discover and develop a new drug is nearly US $500 million, and the average time 
is 15 years from laboratory to patient Improving this process, even incrementally, would represent a huge advance in 
the generation of novel antimicrobial agents. 
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Newly emerging practices in drug discovery utilize a number of biochemical techniques to provide for 
directed approaches to creating new drugs, rather than discovering them at random. For example, gene sequences and 
proteins encoded thereby that are required for the proliferation of an organism make for excellent targets since 
exposure of bacteria to compounds active against these targets would result in the inactivation of the organism. Once 
5 a target is identified, biochemical analysis of that target can be used to discover or to design molecules that interact 

with and alter the functions of the target. Using physical and computational techniques, to analyze structural and 
biochemical targets in order to derive compounds that interact with a target is called rational drug design and offers 
great future potential. Thus, emerging drug discovery practices use molecular modeling techniques, combinatorial 
chemistry approaches, and other means to produce and screen and/or design large numbers of candidate compounds. 

10 Nevertheless, while this approach to drug discovery is clearly the way of the future, problems remain. For 

example, the initial step of identifying molecular targets for investigation can be an extremely time consuming task. It 
may also be difficult to design molecules that interact with the target by using computer modeling techniques. 
Furthermore, in cases where the function of the target is not known or is poorly understood, it may be difficult to 
design assays to detect molecules that interact with and alter the functions of the target. To improve the rate of 

15 novel drug discovery and development, methods of identifying important molecular targets in pathogenic 

microorganisms and methods for identifying molecules that interact with and alter the functions of such molecular 
targets are urgently required. 

Escherichia coli represents an excellent model system to understand bacterial biochemistry and physiology. 
The estimated 4288 genes scattered along the 4.6 x 10 s base pairs of the Escherichia coli\jE. coii\ chromosome offer 

20 tremendous promise for the understanding of bacterial biochemical processes. In turn, this knowledge will assist in 
the development of new tools for the diagnosis and treatment of bacteria-caused human disease. The entire £ coff 
genome has been sequenced, and this body of information holds a tremendous potential for application to the 
discovery and development of new antibiotic compounds. Yet, in spite of this accomplishment, the general functions 
or roles of many of these genes are still unknown. For example, the total number of proliferation-required genes 

25 contained within the E. coli genome is unknown, but has been variously estimated at around 200 to 700 (Armstrong, 

K. A. and Fan, D.P. Essential Genes in the metB-malB Region of Escherichia coli K1 2, 1 975, J. Bacterid. 1 26: 48-55). 

Novel, safe and effective antimicrobial compounds are needed in view of the rapid rise of antibiotic resistant 
microorganisms. However, prior to this invention, the characterization of even a single bacterial gene was a 
painstaking process, requiring years of effort. Accordingly, there is an urgent need for more novel methods to identify 

30 and characterize bacterial genomic sequences that encode gene products required for proliferation and for methods to 

identify molecules that interact with and alter the functions of such genes and gene products. 

SUMMARY OF THE INVENTION 
One embodiment of the present invention is a purified or isolated nucleic acid sequence consisting essentially 
of one of SEQ ID NOs: 1-81, 405-485, wherein said nucleic acid inhibits microorganism proliferation. The nucleic acid 

35 sequence may be complementary to at least a portion of a coding sequence of a gene whose expression is required for 
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microorganism proliferation. The nucleic acid sequence may comprise a fragment of one of SEQ ID NOs. 1-81, 405- 
485, said fragment selected from the group consisting of fragments comprising at least 10, at least 20, at least 25, 
at least 30, at least 50 or more than 50 consecutive bases of one of SEQ ID NOs: 181, 405485. The nucleic acid 
sequence may be complementary to a coding sequence of a gene whose expression is required for microorganism 
5 proliferation. 

Another embodiment of the present invention is a vector comprising a promoter operably linked to a nucleic 
acid comprising a sequence selected from the group consisting of SEQ ID NOs. 1-81, 405485. The promoter may be 
active in an organism selected from the group consisting of Escherichia coli f Staphylococcus aureus, Pseudomonas 
aeruginosa, Enterobacter cloacae, Helicobacter pylori, Neisseria gonorrhoeae, Enterococcus faecalis, Streptococcus 

10 pneumoniae, Haemophilus influenzae, Salmonella typhimurium, Saccharomyces cerevisiae, Candida albicans, 
Cryptococcus neoformans, Aspergillus fumigatus, Klebsiella pneumoniae. Salmonella typhi. Salmonella paratyphi, 
Salmonella cholerasuis, Staphylococcus epidermidis, Mycobacterium tuberculosis, Mycobacterium leprae, Treponema 
pallidum. Bacillus anthracis, Yersinia pestis, Clostridium botulinum, Campylobacter jejuni, Chlamydia trachomatus. 
Chlamydia pneumoniae or any species falling within the genera of any of the above species. 

1 5 Another embodiment of the present invention is a host cell containing the vectors described above. 

Another embodiment of the present invention is a purified or isolated nucleic acid consisting essentially of 
the coding sequence of one of SEQ ID NOs: 82 88, 90-242. One aspect of this embodiment is a fragment of the 
nucleic acid comprising at least 10, at least 20, at least 25, at least 30, at least 50 or more than 50 consecutive 
bases of one of SEQ ID NOs: 82-88, 90-242. 

20 Another embodiment of the present invention isa vector comprising a promoter operably finked to the nucleic 

acids of the preceding embodiment. 

Another aspect of the present invention is a purified or isolated nucleic acid comprising a nucleic acid 
sequence complementary to at least a portion of an intragenic sequence, intergenic sequence, sequences spanning at 
least a portion of two or more genes, 5' noncoding region, or 3' noncoding region within an operon encoding a 

25 polypeptide comprising a sequence selected from the group consisting of SEQ ID NOs: 243-357, 359 398. 

Another embodiment of the present invention is a purified or isolated nucleic acid comprising a nucleic acid 
having at least 70% homology to a sequence selected from the group consisting of SEQ ID NOs 1-81, 405485, 82- 
88, 90-242 or the sequences complementary thereto as determined using BLASTN version 2.0 with the default 
parameters. The nucleic acid may be from an organism selected from the group consisting of Staphylococcus aureus, 

30 Pseudomonas aeruginosa, Enterobacter cloacae, Helicobacter pylori, Neisseria gonorrhoeae, Enterococcus faecalis. 

Streptococcus pneumoniae, Haemophilus influenzae, Salmonella typhimurium, Saccharomyces cerevisiae, Candida 
albicans, Cryptococcus neoformans, Aspergillus fumigatus, Klebsiella pneumoniae, Salmonella typhi, Salmonella 
paratyphi, Salmonella cholerasuis, Staphylococcus epidermidis, Mycobacterium tuberculosis, Mycobacterium leprae, 
Treponema pallidum, Bacillus anthracis, Yersinia pestis, Clostridium botulinum, Campylobacter jejuni, and Chlamydia 

35 trachomatus, Chlamydia pneumoniae or any species falling within the genera of any of the above species. 
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Another embodiment of the present invention is a purified or isolated nucleic acid consisting essentially of a 
nucleic acid encoding a polypeptide having a sequence selected from the group consisting of SEQ ID NOs.: 243-357, 
359-398. 

Another embodiment of the present invention is a vector comprising a promoter operably linked to a nucleic 
acid encoding a polypeptide having a sequence selected from the group consisting of SEQ ID NOs.: 243-357, 359 398. 

Another embodiment of the present invention is a host cell containing the vector of the preceding 
embodiment. 

Another embodiment of the present invention is purified or isolated polypeptide comprising the sequence of 
one of SEQ ID NOs: 243-357, 359-398. 

Another embodiment of the present invention is purified or isolated polypeptide comprising a fragment of one 
of the polypeptides of SEQ ID NOs. 243-357, 359-398, said fragment selected from the group consisting of fragments 
comprising at least 5, at least 10, at least 20, at least 30, at least 40, at least 50, at least 60 or more than 60 
consecutive amino acids of one of the polypeptides of SEQ ID NOs.: 243-357, 359-398. 

Another embodiment of the present invention is an antibody capable of specifically binding the polypeptide of 
the preceding embodiment. 

Another embodiment of the present invention is method of producing a polypeptide, comprising introducing a 
vector comprising a promoter operably linked to a nucleic acid encoding a polypeptide having a sequence selected from 
the group consisting of SEQ ID NOs. 243-357, 359-398into a cell. The method may further comprise the step of 
isolating said protein. 

Another embodiment of the present invention is a method of inhibiting proliferation comprising inhibiting the 
activity or reducing the amount of a polypeptide having a sequence selected from the group consisting of SEQ ID NOs. 
243-357, 359-398or inhibiting the activity or reducing the amount of a nucleic acid encoding said polypeptide. 

Another embodiment of the present invention is method for identifying compounds which influence the 
activity of a polypeptide required for proliferation comprising: 

contacting a polypeptide comprising a sequence selected from the group consisting of 243-357, 359- 
398with a candidate compound; and 

determining whether said compound influences the activity of said polypeptide. 

The activity may be an enzymatic activity. The activity may be a carbon compound catabolism activity. The 
activity may be a biosynthetic activity. The activity may be a transporter activity. The activity may be a 
transcriptional activity. The activity may be a DNA replication activity. The activity may be a cell division activity. 

Another embodiment of the present invention is a compound identified using the above method. 

Another embodiment of the present invention is method for assaying compounds for the ability to reduce the 
activity or level of a polypeptide required for proliferation, comprising: 

providing a target, wherein said target comprises the coding sequence of a sequence selected from the group 
consisting of SEQ ID NOs. 82-88, 90-242; 
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contacting said target with a candidate compound; and 
measuring an activity of said target. 

The target may be a messenger RNA molecule transcribed from a coding region of one of SEQ ID. NOs.: 82- 
88, 90-242 and said activity is translation of said messenger RNA. The target may be a coding region of one of SEQ 
5 ID. NOs. 82-88, 90-242 and said activity is transcription of said messenger RNA. 

Another embodiment of the present invention is a compound identified using the method above. 
Another embodiment of the present invention is a method for identifying compounds which reduce the 
activity or level of a gene product required for cell proliferation comprising the steps of: 

expressing an antisense nucleic acid against a nucleic acid encoding said gene product in a cell to reduce the 
1 0 activity or amount of said gene product in said cell thereby producing a sensitized cell; 
contacting said sensitized cell with a compound; and 

determining whether said compound inhibits the growth of said sensitized cell to a greater extent than said 
compound inhibits the growth of a nonsensitized ceil. 

The cell may be selected from the group consisting of bacterial cells, fungal cells, plant cells, and animal 
15 cells. The cell may be an £ coll cell. The cell may be from an organism selected from the group consisting of 
Staphylococcus aureus, Pseudomonas aeruginosa, Enterobacter cloacae, Helicobacter pylori, Neisseria gonorrhoeae, 
Enterococcus faecalls. Streptococcus pneumoniae, Haemophilus influenzae, Salmonella typhimurium, Saccharomyces 
cerevisiae, Candida albicans, Cryptococcus neoformans, Aspergillus fumigatus, Klebsiella pneumoniae, Salmonella 
typhi, Salmonella paratyphi, Salmonella cholerasuis, Staphylococcus epidermidis, Mycobacterium tuberculosis, 
Mycobacterium leprae, Treponema pallidum, Bacillus anthracis, Yersinia pestis, Clostridium botuffnum, Campylobacter 
jejuni, and Chlamydia trachomatus. Chlamydia pneumoniae or any species falling within the genera of any of the above 
species. The antisense nucleic acid may be transcribed from an inducible promoter. The method may, further comprise 
the step of contacting said cell with a concentration of inducer which induces said antisense nucleic acid to a 
sublethal level. The sub-lethal concentration of said inducer may be such that growth inhibition is 8% or more. The 
25 inducer may be isopropyl-1-thio-p-D-galactoside. The growth inhibition may be measured by monitoring optical density 
of a culture growth solution. The gene product may be a polypeptide. The gene product may be an RNA. The gene 
product may comprise a polypeptide having a sequence selected from the group consisting of SEQ ID NOs.: 243-357, 
359-398. 

Another embodiment of the present invention is a compound identified using the method above. 
30 Another embodiment of the present invention is a method for inhibiting cellular proliferation comprising 

introducing a compound with activity against a gene corresponding to one of SEQ ID NOs.: 82-88, 90-242 or with 
activity against the product of said gene into a population of cells expressing a gene. The compound may be an 
antisense oligonucleotide comprising a sequence selected from the group consisting of SEQ ID NOs.: 1-81, 405-485, 
or a proliferation inhibiting portion thereof. The proliferation inhibiting portion of one of SEQ ID NOs. 1-81, 405485 



20 
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may be a fragment comprising at least 10, at least 20, at least 25, at least 30, at least 50 or more than 50 
consecutive bases of one of SEQ ID NOs: 1-81. 405485. The compound may be a triple helix oligonucleotide. 

Another embodiment of the present invention is a preparation comprising an effective concentration of an 
antisense oligonucleotide comprising a sequence selected from the group consisting of SEQ ID NOs.: 1-81, 405485, 
or a proliferation-inhibiting portion thereof in a pharmaceuticaliy acceptable carrier. The proliferation-inhibiting portion 
of one of SEQ ID NOs. 1-81, 405485 may comprise at least 10, at least 20, at least 25, at least 30, at least 50 or 
more than 50 consecutive bases of one of SEQ ID NOs: 1-81, 405 485. 

Another embodiment of the present invention is a method for inhibiting the expression of a gene in an operon 
required for proliferation comprising contacting a cell in a cell population with an antisense nucleic acid, said cell 
expressing a gene corresponding to one of SEQ 10 NOs.: 82-88, 90-242, wherein said antisense nucleic acid comprises 
at least a proliferation-inhibiting portion of said operon in an antisense orientation that is effective in inhibiting 
expression of said gene. The antisense nucleic acid may be complementary to a sequence of a gene comprising one or 
more of SEQ ID NOs.: 82-88, 90-242. The antisense nucleic acid may be a sequence of one of SEQ ID NOs.: 1-81, 
405485, or a portion thereof. The cell may be contacted with said antisense nucleic acid by introducing a plasmid 
which expresses said antisense nucleic acid into said cell population. The ceil may be contacted with said antisense 
nucleic acid by introducing a phage which expresses said antisense nucleic acid into said cell population. The cell may 
be contacted with said antisense nucleic acid by introducing a sequence encoding said antisense nucleic acid into the 
chromosome of said cell into said cell population. The cell may be contacted with said antisense nucleic acid by 
introducing a retron which expresses said antisense nucleic acid into said cell population. The cell may be contacted 
with said antisense nucleic acid by introducing a ribozyme into said cell-population, wherein a binding portion of said 
ribozyme is complementary to said antisense oligonucleotide. The ceil may be contacted with said antisense nucleic 
acid by introducing a liposome comprising said antisense oligonucleotide into said cell. The cell may be contacted with 
said antisense nucleic acid by electroporation. The antisense nucleic acid may be a fragment comprising at least 10, 
at least 20, at least 25, at least 30, at least 50 or more than 50 consecutive bases of one of SEQ ID NOs: 82-88, 90- 
242. The antisense nucleic acid may be an oligonucleotide. 

Another embodiment of the present invention is a method for identifying bacterial strains comprising the 

steps of: 

providing a sample containing a bacterial species; and 

identifying a bacterial species using a species specific probe having a sequence selected from the group 
consisting of SEQ ID NOs. 1-81, 405485, 82-88, 90-242. 

Another embodiment of the present invention is a method for identifying a gene in a microorganism required 
for proliferation comprising: 

(a) identifying an inhibitory nucleic acid which inhibits the activity of a gene or gene product required for 
proliferation in a first microorganism; 

(b) contacting a second microorganism with said inhibitory nucleic acid; 
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(c) determining whether said inhibitory nucleic acid from said first microorganism inhibits proliferation of said 
second microorganism; and 

(d) identifying the gene in said second microorganism which is inhibited by said inhibitory nucleic acid. 
Another embodiment of the present invention is a method for assaying a compound for the ability to inhibit 

5 proliferation of a microorganism comprising: 

(a) identifying a gene or gene product required for proliferation in a first microorganism; 

(b) identifying a homolog of said gene or gene product in a second microorganism; 

(c) identifying an inhibitory nucleic acid sequence which inhibits the activity of said homolog in said second 
microorgansim; 

10 ld( contacting said second microorganism with a proliferation-inhibiting amount of said inhibitory nucleic acid, 

thus sensitizing said second microorganism; 

(e) contacting the sensitized microorganism of step (d) with a compound; and 

(f) determining whether said compound inhibits proliferation of said sensitized microorganism to a greater 
extent than said compound inhibits proliferation of a nonsensitized microorganism. 

15 The step of identifying a gene involved in proliferation in a first microorganism may comprise: 

introducing a nucleic acid comprising a random genomic fragment from said first microorganism operably t 
linked to a promoter wherein said random genomic fragment is in the antisense orientation; and ** 

comparing the proliferation of said first microorganism transcribing a first level of said random genomic 
fragment to the proliferation of said first microorganism transcribing a lower level of said random genomic fragment, 
20 wherein a difference in proliferation indicates that said random genomic fragment comprises a gene involved in 

proliferation. 

The step of identifying a homolog of said gene in a second microorganism may comprise identifying a 
homologous nucleic acid or a nucleic acid encoding a homologous polypeptide in a database using an algorithm 
selected from the group consisting of BLASTN version 2.0 with the default parameters and FASTA version 3.0t78 

25 algorithm with the default parameters. The step of identifying a homolog of said gene in a second microorganism may 

comprise identifying a homologous nucleic acid or a nucleic acid encoding a homologous polypeptide by identifying 
nucleic acids which hybridize to said first gene. The step of identifying a homolog of said gene in a second 
microorganism may comprise expressing a nucleic acid which inhibits the proliferation of said first microorganism in 
said second microorganism. The inhibitory nucleic acid may be an antisense nucleic acid. The inhibitory nucleic acid 

30 may comprise an antisense nucleic acid to a portion of said homolog. The inhibitory nucleic acid may comprise an 
antisense nucleic acid to a portion of the operon encoding said homolog. The step of contacting the second 
microorganism with a proliferation-inhibiting amount of said nucleic acid sequence may comprise directly contacting 
said second microorganism with said nucleic acid. The step of contacting the second microorganism with a 
proliferation-inhibiting amount of said nucleic acid sequence may comprise expressing an antisense nucleic acid to said 

35 homolog in said second microorganism. 
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Another embodiment of the present invention is a compound identified using the method above. 
Another embodiment of the present invention is a method of assaying a compound for the ability to inhibit 
proliferation comprising: 

(a) identifying an inhibitory nucleic acid sequence which inhibits the activity of a gene or gene product required 
for proliferation in a first microorgansim; 

(b) contacting a second microorganism with a proliferation-inhibiting amount of said inhibitory nucleic acid, thus 
sensitizing said second microorganism; 

(c) contacting the proliferation-inhibited microorganism of step (b) with a compound; and 

(d) determining whether said compound inhibits proliferation of said sensitized second microorganism to a 
greater extent than said compound inhibits proliferation of a nonsensitized second microorganism. 

The inhibitory nucleic acid may be an antisense nucleic acid which inhibits the proliferation of said first 
microorganism. The inhibitory nucleic acid may comprise a portion of an antisense nucleic acid which inhibits the 
proliferation of said first microorganism. The inhibitory nucleic acid may comprise an antisense molecule against the 
entire coding region of the gene involved in proliferation of the first microorganism. The inhibitory nucleic acid may 
comprise an antisense nucleic acid to a portion of the operon encoding the gene involved in proliferation of the first 
microorganism. 

Another embodiment of the present invention is a compound identified using the method above. 

Another embodiment of the present invention is a method for assaying compounds for activity against a 

biological pathway required for proliferation comprising: 

sensitizing a cell by expressing an antisense nucleic acid against a nucleic acid encoding a gene product 

required for proliferation in a cell to reduce the activity or amount of said gene product- 
contacting the sensitized cell with a compound; and 

determining whether said compound inhibits the growth of said sensitized cell to a greater extent than said 
compound inhibits the growth of an nonsensitized cell. 

The cell may be selected from the group consisting of bacterial cells, fungal cells, plant cells, and animal 
cells. The ceil may be an £ coli cell. The cell may be an organism selected from the group consisting of 
Staphylococcus aureus, Pseudomonas aeruginosa, Enterobacter cloacae, Helicobacter pylori, Neisseria gonorrhoeae, 
Enterococcus faecalis, Streptococcus pneumoniae, Haemophilus influenzae, Salmonella typhimurium, Saccharomyces 
cerevisiae, Candida albicans, Cryptococcus neoformans, Aspergillus fumigatus, Klebsiella pneumoniae, Salmonella 
typhi. Salmonella paratyphi. Salmonella cholerasuis, Staphylococcus epidermidis, Mycobacterium tuberculosis, 
Mycobacterium leprae, Treponema pallidum, Bacillus anthracis, Yersinia pestis, Clostridium botulinum, Campylobacter 
jejuni, and Chlamydia trachomatus, Chlamydia pneumoniae or any species falling within the genera of any of the above 
species. The antisense nucleic acid may be transcribed from an inducible promoter. The method may further comprise 
contacting the cell with an agent which induces expression of said antisense nucleic acid from said inducible promoter, 
wherein said antisense nucleic acid is expressed at a sublethal level. The sublethal level of said antisense nucleic acid 
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may inhibit proliferation by 8% or more. The agent may be isopropyl-Mhio-p-D-galactoside (IPTG). The inhibition of 
proliferation may be measured by monitoring the optical density of a liquid culture. The gene product may comprise a 
polypeptide having a sequence selected from the group consisting of SEQ ID NOs: 243-357, 359-398. 

Another embodiment of the present invention is a compound identified using the method above. 

Another embodiment of the present invention is a method for assaying a compound for the ability to inhibit 
cellular proliferation comprising: 

contacting a cell with an agent which reduces the activity or level of a gene product required for 
proliferation of said cell- 
contacting said cell with said compound; and 

determining whether said compound reduces proliferation to a greater extent than said compound reduces 
proliferation of ceils which have not been contacted with said agent. 

The agent which reduces the activity or level of a gene product required for proliferation of said cell may 
comprise an antisense nucleic acid to a gene or operon required for proliferation. The agent which reduces the activity 
or level of a gene product required for proliferation of said ceil may comprise an antibiotic. The cell may contain a 
temperature sensitive mutation which reduces the activity or level of said gene product required for proliferation of 
said cell. The antisense nucleic acid may be directed against the same functional domain of said gene product required 
for proliferation of said cell to which said antisense nucleic acid is directed. The antisense nucleic acid may be directed 
against a different functional domain of said gene product required for proliferation of said cell than the fucntional 
domain to which said antisense nucleic acid is directed. 

Another embodiment of the present invention is a compound identified using the method above. 

Another embodiment of the present invention is a method for identifying the pathway in which a 
proliferation-required nucleic acid or its gene product lies comprising: 

expressing a sublethal level of an antisense nucleic acid directed against said proliferation-required nucleic 
acid in a cell; 

contacting said cell with an antibiotic, wherein the a biological pathway on which said antibiotic acts is 
known; and 

determining whether said cell has a substantially greater sensitivity to said antibiotic than a cell which does 
not express said sublethal level of said antisense nucleic acid. 

Another embodiment of the present invention is a method for determining the pathway on which a test 
compound acts comprising: 

(a) expressing a sublethal level of an antisense nucleic acid directed against a proliferation-required nucleic 
acid in a cell, wherein the biological pathway in which said proliferation-required nucleic acid lies is known, 

(b) contacting said cell with said test compound; and 

(c) determining whether said cell has a substantially greater sensitivity to said test compound than a cell 
which does not express said sublethal level of said antisense nucleic acid. 



WO 00/44906 PCTAJSOO/02200 
The method may further comprise: 

(d) expressing a sublethal level of a second antisense nucleic acid directed against a second proliferation-required 
nucleic acid in said cell, wherein said second proliferation-required nucleic acid is in a different biological pathway than 
said proliferation-required nucleic acid in step (a); and 

(e) determining whether said cell has a substantially greater sensitivity to said test compound than a cell which does 
not express said sublethal level of said second antisense nucleic acid. 

Another embodiment of the present invention is a purified or isolated nucleic acid consisting essentially of 
one of SEQ ID NOs: 358, 399402. 

Another embodiment of the present invention is a purified or isolated nucleic acid comprising a sequence 
selected from the group consisting of 1-81, 405485, 82-88, 90-242, 358, 399402. 

Another embodiment of the present invention is a compound which interacts with the gene or gene product 
of a nucleic acid comprising a sequence of one of SEQ ID NOs: 82-88, 90-242 to inhibit proliferation. 

Another embodiment of the present invention compound which interacts with a polypeptide comprising one 
of SEQ ID NOs. 243-357, 359-398 to inhibit proliferation. 

Another embodiment of the present invention is a compound which interacts with a nucleic acid comprising 
one of SEQ ID NOs: 358, 399402 to inhibit proliferation. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 is an IPTG dose response curve in £ coli transformed with an IPTG-inducible plasmid 
containing either an antisense clone to the £ coli ribosomal protein rplW (AS-rplW) which is required for protein 
synthesis and essential cell proliferation, or an antisense clone to the elaD (AS-elaD) gene which is not known to be 
involved in protein synthesis and which is also essential for proliferation. 

Figure 2A is a tetracycline dose response curve in £ coli transformed with an IPTG-inducible 
plasmid containing antisense to rplW(AS-rplW) in the presence of 0, 20 or 50 \M IPTG. 

Figure 2B is a tetracycline dose response curve in £ coli transformed with an IPTG-inducible 
plasmid containing antisense to elaD (AS-elaD) in the presence of 0, 20 or 50 \xlA IPTG. 

Figure 3 is a graph showing the fold increase in tetracycline sensitivity of £ coli transfected with 
antisense clones to essential ribosomal proteins L23 (AS rpiW) and L7/L12 and L10 (AS-rplLrplJ). Antisense clones to 
genes known not to be involved in protein synthesis (atpB/E(AS-atpB/E ), visC (AS-visC, elaD (AS-elaD), yohH (AS- 
yohH) are much less sensitive to tetracycline. 

Definitions 

By "biological pathway" is meant any discrete cell function or process that is carried out by a gene product 
or a subset of gene products. Biological pathways include enzymatic, biochemical and metabolic pathways as well as 
pathways involved in the production of cellular structures such cell walls. Biological pathways that are usually 
required for proliferation of microorganisms include, but are not limited to, cell division, DNA synthesis & replication, 
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RNA synthesis (transcription), protein synthesis (translation), protein processing, protein transport, fatty acid 
biosynthesis, eel) wall synthesis, cell membrane synthesis & maintenance, etc 

By "inhibit activity against a gene or gene product" is meant having the ability to interfere with the function 
of a gene or gene product in such a way as to decrease expression of the gene or to reduce the level or activity of a 
product of the gene. Agents which have activity against a gene include agents that inhibit transcription of the gene 
and agents that inhibit translation of the mRNA transcribed from the gene. In microorganisms, agents which have 
activity against a gene can act to decrease expression of the operon in which the gene resides or alter the processing 
of operon RNA such as to reduce the level or activity of the gene product. The gene product can be a non-translated 
RNA such as ribosomal RNA, a translated RNA (mRNA) or the protein product resulting from translation of the gene 
mRNA. Of particular utility to the present invention are anti-sense RNAs that have activities against the operons or 
genes to which they specifically hybridze. 

By "activity against a gene product" is meant having the ability to inhibit the function or to reduce the level 
or activity of the gene product in a cell. 

By "activity against a protein" is meant having the ability to inhibit the function or to reduce the level or 
activity of the protein in a ceil. 

By "activity against nucleic acid" is meant having the ability to inhibit the function or to reduce the level or 
activity of the nucleic acid in a cell. 

As used herein, "sublethal" means a concentration of an agent below the concentration required to inhibit all 
cell growth. 

DETAILED DESCRIPTION OFTHF INVPNTIOM 

The present invention describes a group of £ coli genes and gene famifies required for growth and/or 
proliferation. A proSferation-required gene or gene family is one where, in the absence of a gene transcript and/or gene 
product, growth or viability of the microorganism is reduced or eliminated. Thus, as used herein the terminology 
"proliferation-required" or "required for proliferation" encompasses sequences where the absence of a gene transcript 
and/or gene product completely eliminates cell growth as well as sequences where the absence of a gene transcript and/or 
gene product merely reduces cell growth. These proliferation-required genes can be used as potential targets for the 
generation of new antimicrobial agents. To achieve that goal, the present invention also encompasses novel assays for 
analyzing proliferation-required genes and for identifying compounds which interact with the gene products of the 
proliferation-required genes. In addition, the present invention contemplates the expression of genes and the purification of 
the proteins encoded by the nucleic acid sequences identified as required proliferation genes and reported herein. The 
purified proteins can be used to generate reagents and screen small molecule libraries or other candidate compound libraries 
for compounds that can be further developed to yield novel antimicrobial compounds. The present invention also describes 
methods for identification of homologous genes in organisms other than £ colL 

The present invention utilizes a novel method to identify proliferation-required £ coli sequences. Generally, a 
library of nucleic acid sequences from a given source are subcloned or otherwise inserted into an inducible expression 
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vector, thus forming an expression library. Although the insert nucleic acids may be derived from the chromosome of the 
organism into which the expression vector is to be introduced, because the insert is not in its natural chromosomal location, 
the insert nucleic acid is an exogenous nucleic acid for the purposes of the discussion herein. The term expression is 
defined as the production of an RNA molecule from a gene, gene fragment, genomic fragment, or operon. Expression can 
also be used to refer to the process of peptide or polypeptide synthesis. An expression vector is defined as a vehicle by 
which a ribonucleic acid (RNA) sequence is transcribed from a nucleic acid sequence carried within the expression vehicle. 
The expression vector can also contain features that permit translation of a protein product from the transcribed RNA 
message expressed from the exogenous nucleic acid sequence carried by the expression vector. Accordingly, an expression 
vector can produce an RNA molecule as its sole product or the expression vector can produce a RNA molecule that is 
ultimately translated into a protein product. 

Once generated, the expression library containing the exogenous nucleic acid sequences is introduced into an £ 
coS population to search for genes that are required for bacterial proliferation. Because the library molecules ere foreign to 
the population of £ coli, the expression vectors and the nucleic acid segments contained therein are considered exogenous 
nucleic ecid. 

Expression of the exogenous nucleic acid fragments in the test population of £ coli containing the expression 
vector library is then activated. Activation of the expression vectors consists of subjecting the cells containing the vectors 
to conditions that result in the expression of the exogenous nucleic acid sequences carried by the expression vector library. 
The test population of £ coli cells is then assayed to determine the effect of expressing the exogenous nucleic acid 
fragments on the test population of cells. Those expression vectors that, upon activation and expression, negatively 
impact the growth of the £ coff screen population were identified, isolated, and purified for further study. 

A variety of assays are contemplated to identify nucleic acid sequences that negatively impact growth upon 
expression. In one embodiment growth in £ coli cultures expressing exogenous nucleic ecid sequences and growth in 
cultures not expressing these sequences is compared. Growth measurements are assayed by examining the extent of 
growth by measuring optical densities. Alternatively, enzymatic assays can be used to measure bacterial growth rates 
to identify exogenous nucleic acid sequences of interest. Colony size, colony morphology, and cell morphology are 
additional factors used to evaluate growth of the host cells. Those cultures that failed to grow or grow with reduced 
efficiency under expression conditions are identified as containing an expression vector encoding a nucleic acid fragment 
that negatively affects a proliferation-required gene. 

Once exogenous nucleic acid sequences of interest are identified, they are analyzed. The first step of the 
analysis is to acquire the nucleic acid sequence of the nucleic acid fragment of interest. To achieve this end, the insert in 
those expression vectors identified as containing a sequence of interest is sequenced, using standard techniques well 
known in the art The next step of the process is to determine the source of the nucleic acid sequence. 

Determination of sequence source is achieved by comparing the obtained sequence data with known sequences 
in various genetic databases. The sequences identified are used to probe these gene databases. The result of this 
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procedure is a list of exogenous nucleic add sequences corresponding to a list that includeds novel bacterial genes required 
for proliferation as well as genes previously identified as required for proliferation. 

The number of DNA and protein sequences available in database systems has been growing exponentially for 
years. For example, at the end of 1998, the complete sequences of Caenortiab&tis elegans, Saccharomyces cerevistae and 
nineteen bacterial genomes, including £ ra# were available. This sequence information is stored in a number of databanks, 
such as GenBank (the National Center for Biotechnology Information (NCBI), and is publicly ava8able for searching. 

A variety of computer programs are available to assist in the analysis of the sequences stored within these 
databases. FastA, (W. R. Pearson (1990) "Rapid and Sensitive Sequence Comparison with FASTP and FASTA" 
Methods in Enzymology 183:63- 98), Sequence Retrieval System (SRS), (Etzold & Argos, SRS an indexing and 
retrieval tool for flat file data libraries. Comput. Appl. Biosci. 9:49-57, 1993) are two examples of computer programs 
that can be used to analyze sequences of interest. In one embodiment of the present invention, the BLAST family of 
computer programs, which includes BLASTN version 2.0 with the default parameters, or BLASTX version 2.0 with the 
default parameters, is used to analyze nucleic acid sequences. 

BLAST, an acronym for "Basic Local Alignment Search Tool," is a family of programs for database similarity 
searching. The BLAST family of programs includes: BLASTN, a nucleotide sequence database searching program, 
BLASTX, a protein database searching program where the input is a nucleic acid sequence; and BLASTP, a protein 
database searching program. BLAST programs embody a fast algorithm for sequence matching, rigorous statistical 
methods for judging the significance of matches, and various options for tailoring the program for special situations. 
Assistance in using the program can be obtained by e-mail at blast@ncbi.nlm.nih.oov. 

Bacterial genes are often transcribed in polycistronic groups. These groups comprise operons, which are a 
coDection of genes and intergenic sequences. The genes of an operon are co-transcribed and are often related functionally. 
Given the nature of the screening protocol, it is possible that the identified exogenous nucleic acid sequence corresponds to 
a gene or portion thereof with or without adjacent noncoding sequences, an intragenic sequence (i.e. a sequence within a 
gene), an intergenic sequence (i.e. a sequence between genes), a sequence spanning at least a portion of two or more 
genes, a 5' noncoding region or a 3' noncoding region located upstream or downstream from the actual sequence that is 
required for bacterial proliferation. Accordingly/determining which of the genes that are encoded within the operans are 
individually required for proliferation is often desirable. 

In one embodiment of the present invention, an operon is dissected to determine which gene or genes are required 
for proliferation. For example, the RegulonDB DataBase described by Huerta et aL \filud Acids Res. 26:55-59, 1998), 
which may also be found on the website http://www.cifn.unam.mx/ComputationalJiology/regulondbf, may be used, to 
identify the boundaries of operons encoded within microbial genomes. A number of techniques that are well known in the 
art can be used to dissect the operon. In one aspect of this embodiment, gene disruption by homologous recombination is 
used to individuaOy inactivate the genes of an operon that is thought to contain a gene required for proliferation. 

Several gene disruption techniques have been described for the replacement of a functional gene with a 
mutated, non-functional (null) allele. These techniques generally involve the use of homologous recombination. The 
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method described by Link et al. (J. Bacteriol 1997 179:6228; incorporated herein by reference in it's entirety) serves 
as an excellent example of these methods as applicable to disruption of genes in £ colL This technique uses crossover 
PCR to create a null allele with an m-frame deletion of the coding region of a target gene. The null allele is 
constructed in such a way that sequences adjacent to the wild type gene (ca. 500 bp) are retained. These 
homologous sequences surrounding the deletion null allele provide targets for homologous recombination so that the 
wild type gene on the £ a?// chromosome can be replaced by the constructed null allele. 

The crossover PCR amplification product is subcloned into the vector pK03, the features of which include a 
chloramphenicol resistance gene, the counter-selectable marker sacB, and a temperature sensitive autonomous 
replication function. Following transformation of an £ caff cell population with such a vector, selection for cells that 
have undergone homologous recombination of the vector into the chromosome is achieved by growth on 
chloramphenicol at the non-permissive temperature of 43°C. Under these conditions, autonomous replication of the 
plasmid cannot occur and cell are resistant to chloramphinicol only if the chloramphenicol resistance gene has been 
integrated into the chromosome. Usually a single crossover event is responsible for this integration event such that 
the £ colt chromosome now contains a tandem duplication of the target gene consisting of one wild type allele and 
one deletion null allele separated by vector sequence. 

This new £ oxf/ strain containing the tandem duplication can be maintained at permissive temperatures in 
the presence of drug selection (chloramphenicol). Subsequently, cells of this new strain are cultured at the permissive 
temperature 30°C without drug selection. Under these conditions, the chromosome of some of the cells within the 
population will have undergone an internal homologous recombination event resulting in removal of the plasmid 
sequences. Subsequent culturing of the strain in growth medium lacking chloramphenicol but containing sucrose is 
used to select for such recombinative resolutions. In the presence of the counter-selectable marker sacB, sucrose is 
rendered into a toxic metabolite. Thus, cells that survive this counter-selection have lost both the plasmid sequences 
from the chromosome and the autonomously replicating plasmid that results as a byproduct of recombinative 
resolution. 

There are two possible outcomes of the above recombinative resolution via homologous recombination. 
Either the wild type copy of the targeted gene is retained on the chromosome or the mutated null allele is retained on 
the chromosome. In the case of an essential gene, a single copy of the null allele would be lethal and such cells should 
not be obtained by the above procedure when applied to essential genes. In the case of a non-essential gene, roughly 
equal numbers of cells containing null alleles and cells containing wild type alleles should be obtained. Thus, the 
method serves as a test for essentiality of the targeted gene: when applied to essential genes, only cells with a wild 
type allele on the chromosome will be obtained. 

Other techniques have also been described for the creation of disruption mutations in £ cofi. For example, 
Link et al. also describe inserting an inframe sequence tag concommitantly with an in-frame deletion in order to 
simplify analysis of recombinants obtained. Further, Link et al. describe disruption of genes with a drug resistance 
marker such as a kanamycin resistance gene. Arigoni et al., (Arigoni, F. et al. A Genome-based Approach for the 
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Identification of Essential Bacterial Genes. Nature Biotechnology 16: 851-856. the disclosure of which is incorporated 
herein by reference in its entirety) describe the use of gene disruption combined with engineering a second copy of a 
test gene such that the expression of the gene is regulated by and inducible promoter such as the arabinose promoter 
to test the essentiality of the gene. Many of these techniques result in the insertion of large fragments of DMA into 
the gene of interest, such as a drug selection marker. An advantage of the technique described by Link et al. is that it 
does not rely on an insertion into the gene to cause a functional defect, but rather results in the precise removal of the 
coding region. This insures the lack of polar effects on the expression of genes downstream from the target gene. 

Recombinant DNA techniques can be used to express the entire coding sequences of the gene identified as 
required for proliferation, or portions thereof. The over-expressed proteins can be used as reagents for further study. The 
identified exogenous sequences are isolated, purified, and cloned into a suitable expression vector using methods well 
known in the art. If desired, the nucleic acids can contain the sequences encoding a signal peptide to facilitate secretion of 
the expressed protein. 

Expression of fragments of the bacterial genes identified as required for proliferation is also contemplated by the 
present invention. The fragments of the identified genes can encode a polypeptide comprising at least 5, at least 10. at 
least 15. at least 20, at least 25, at least 30, at least 35, at least 40, at least 45, at least 50, at least 55, at least 60. at 
least 65. at least 75, or more than 75 consecutive amino acids of a gene complementary to one of the identified sequences 
of the present invention. The nucleic acids inserted into the expression vectors can also contain sequences upstream and 
downstream of the coding sequence. 

When expressing the coding sequence of an entire gene identified as required for bacterial proliferation or a 
fragment thereof, the nucleic acid sequence to be expressed is operably linked to a promoter in an expression vector using 
conventional cloning technology. The expression vector can be any of the bacterial, insect, yeast, or mammalian expression 
systems known in the art. Commercially available vectors and expression systems are available from a variety of suppliers 
including Genetics Institute (Cambridge, MA), Stratagene (La Jolla, California), Promega (Madison, Wisconsin), and 
Invitrogen (San Diego, California). If desired, to enhance expression and facilitate proper protein folding, the codon usage 
and codon bias of the sequence can be optimized for the particular expression organism in which the expression vector is 
introduced, as explained by Hatfield, et al., U.S. Patent No. 5,082,767, incorporated herein by this reference. Fusion 
protein expression systems are also contemplated by the present invention. 

Following expression of the protein encoded by the identified exogenous nucleic acid sequence, the protein is 
purified. Protein purification techniques are well known in the art. Proteins encoded and expressed from identified 
exogenous nucleic acid sequences can be partially purified using precipitation techniques, such as precipitation with 
polyethylene glycol. Chromatographic methods usable with the present invention can include ion-exchange 
chromatography, gel filtration, use of hydroxyapaptite columns, immobilized reactive dyes, chromatofocusing, and use of 
high-performance liquid chromatography. Electrophoretic methods such one-dimensional gel electrophoresis, high-resolution 
two-dimensional polyacrylamide electrophoresis, isoelectric focusing, and others are contemplated as purification methods. 
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Also, affinity chromatographic methods, comprising antibody columns, ligand presenting columns and other affinity 
chromatographic matrices are contemplated as purification methods in the present invention. 

The purified proteins produced from the gene coding sequences identified as required for proliferation can be used 
in a variety of protocols to generate useful antimicrobial reagents. In one embodiment of the present invention, antibodies 
are generated against the proteins expressed from the identified exogenous nucleic acid sequences. Both monoclonal and 
polyclonal antibodies can be generated against the expressed proteins. Methods for generating monoclonal and polyclonal 
antibodies are well known in the art. Also, antibody fragment preparations prepared from the produced antibodies 
discussed above are contemplated. 

Another application for the purified proteins of the present invention is to screen small molecule libraries for 
candidate compounds active against the various target proteins of the present invention. Advances in the field of 
combinatorial chemistry provide methods, well known in the art, to produce large numbers of candidate compounds that 
can have a binding, or otherwise inhibitory effect on a target protein. Accordingly, the screening of small molecule libraries 
for compounds with binding affinity or inhibitory activity for a target protein produced from an identified gene sequence is 
contemplated by the present invention. 

The present invention further contemplates utility against a variety of other pathogenic organisms in addition to 
£ caff. For example, the invention has utility in identifying genes required for proliferation in prokaryotes and eukaryotes. 
For example, the invention has utility with protists, such as Plasmodium spp.; plants; animals, such as Entamoeba spp. 
and Contracaecum spp; and fungi including Candida spp., (e.g., Candida albicans), Saccharomyces cerevisiae, 
Cryptococcus neoformans, and Aspergillus fumigatus. In one embodiment of the present invention, monera, specifically 
bacteria are probed in search of novel gene sequences required for prolif eration. This embodiment is particularly important 
given the rise of drug resistant bacteria. 

The numbers of bacterial species that are becoming resistant to existing antibiotics are growing. A partial list of 
these organisms includes: Staphylococcus spp., such as S. aureus; Enterococcus spp., such as £ faecalis; 
Pseudomonas spp., such as P. aeruginosa, Clostridium spp. f such as C. botulinum, Haemophilus spp., such as //. 
influenzae, Enterobacter spp., such as £ cloacae, Vibrio spp., such as V. cholera; Moraxala spp., such as M. 
catarrhal; Streptococcus spp., such as S. pneumoniae. Neisseria spp., such as N. gonorrhoeae; Mycoplasma spp., 
such as Mycoplasma pneumoniae; Salmonella typhfmurium; Helicobacter pylorr, Escherichia coll; and Mycobacterium 
tuberculosis. The sequences identified as required for proliferation in the present invention can be used to probe these 
and other organisms to identify homologous required proliferation genes contained therein. 

In one embodiment of the present invention, the nucleic acid sequences disclosed herein are used to screen 
genomic libraries generated from bacterial species of interest other than £ col For example, the genomic library may be 
from Staphylococcus aureus, Pseudomonas aeruginosa, Enterobacter cloacae, Helicobacter pylori, Neisseria 
gonorrhoeae, Enterococcus faecalis, Streptococcus pneumoniae, Haemophilus influenzae, Salmonella typhimurium, 
Saccharomyces cerevisiae, Candida albicans, Cryptococcus neoformans, Aspergillus fumigatus, Klebsiella pneumoniae, 
Salmonella typhi, Saimoneila paratyphi, Salmonella cholerasuis, Staphylococcus epidermidis, Mycobacterium 
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tuberculosis, Mycobacterium leprae. Treponema pallidum, Bacillus anthracis. Yersinia pestis, Clostridium botuBnum, 
Campylobacter jejuni. Chlamydia trachomatus. Chlamydia pneumoniae or any species falBng within the genera of any 
of the above species. Standard molecular biology techniques are used to generate genomic libraries from various 
microorganisms. In one aspect, the libraries are generated and bound to nitrocellulose paper. The identified exogenous 
nucleic acid sequences of the present invention can then be used as probes to screen the libraries for homologous 
sequences. The homologous sequences identified can then be used as targets for the identification of new. antimicrobial 
compounds with activity against more than one organism. 

For example, the preceding methods may be used to isolate nucleic acids having a sequence with at least 
97%. at least 95%. at least 90%, at least 85%, at least 80%, or at least 70% identity to a nucleic acid sequence 
selected from the group consisting of one of the sequences of SEQ ID NOS. 1-81. 405485, 82-88, 90-242, fragments 
comprising at least 10, 15, 20. 25. 30. 35, 40, 50. 75. 100, 150. 200. 300, 400. or 500 consecutive bases thereof, and 
the sequences complementary thereto. Identity may be measured using BLASTN version 2.0 with the default 
parameters. (Altschul, S.F. et al. Gapped BLAST and PSI-BLAST: A New Generation of Protein Database Search 
Programs. Nucleic Acid Res. 25: 3389-3402 (1997), the disclosure of which is incorporated herein by reference in its 
entirety). For example, the homologous polynucleotides may have a coding sequence which is a naturally occurring 
allelic variant of one of the coding sequences described herein. Such allelic variants may have a substitution, deletion 
or addition of one or more nucleotides when compared to the nucleic acids of SEO ID NOs: 1-81, 405485, 82-88. 90- 
242 or the sequences complementary thereto. 

Additionally, the above procedures may be used to isolate nucleic acids which encode polypeptides having at 
least 99%. 95%. at least 90%, at least 85%, at least 80%. at least 70%, at least 60%. at least 50%, or at least 40% 
identity or similarity to a polypeptide having the sequence of one of SEO ID NOs: 243-357, 359-398or fragments 
comprising at least 5. 10. 15. 20. 25. 30, 35, 40. 50. 75, 100, or 150 consecutive amino acids thereof as determined 
using the FASTA version 3.0t78 algorithm with the default parameters. Alternatively, protein identity or similarity 
may be identified using BLASTP with the default parameters, BLASTX with the default parameters, or TBLASTN with 
the default parameters. (Alschul. S.F. et al. Gapped BLAST and PSI-BLAST: A New Generation of Protein Database 
Search Programs, Nucleic Acid Res. 25: 3389-3402 (1997), the disclosure of which is incorporated herein by 
reference in its entirety). 

Alternatively, homologous nucleic acids or polypeptides may be identified by searching a database to identify 
sequences having a desired level of homology to a nucleic acid or polypeptide involved in proliferation or an antisense 
nucleic acid to a nucleic acid involved in microbial proliferation. A variety of such databases are available to those 
skilled in the art, including GenBank and GenSeq. In some embodiments, the databases are screened to identify 
nucleic acids or polypeptides having at least 97%, at least 95%. at least 90%, at least 85%, at least 80%. at least 
70%. at least 60%, or at least 50%. at least 40% identity or similarity to a nucleic acid or polypeptide involved in 
proliferation or an antisense nucleic acid involved in proliferation. For example, the database may be screened to 
identify nucleic acids homologous to one of SEO ID Nos. 1-81, 405485, 82-88, 90-242 or polypeptides homologous 
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to SEQ ID NOs. 243-357, 359 398. In some embodiments, the database may be screened to identify homologous 
nucleic acids or polypeptides from organisms other than £ cot/, including organisms such as Staphylococcus aureus, 
Pseudomonas aeruginosa, Enterobacter cloacae, Helicobacter pylori, Neisseria gonorrhoeae, Enterococcus faecahs, 
Streptococcus pneumoniae, Haemophilus influenzae, Salmonella typhimurium, Saccharomyces cerevisiae, Candida 
albicans, Cryptococcus neoformans, Aspergillus fumigatus, Klebsiella pneumoniae, Salmonella typhi, Salmonella 
paratyphi, Salmonella cholerasuis, Staphylococcus epidermidis, Mycobacterium tuberculosis, Mycobacterium leprae, 
Treponema pallidum, Bacillus anthracis, Yersinia pestis, Clostridium botulinum, Campylobacter jejuni,Chlamydia 
trachomatus, Chlamydia pneumoniae or any species falling within the genera of any of the above species. 

In another embodiment, gene expression arrays and microarrays can be employed. Gene expression arrays 
are high density arrays of DNA samples deposited at specific locations on a glass chip, nylon membrane, or the like. 
Such arrays can be used by researchers to quantify relative gene expression under different conditions. Gene 
expression arrays are used by researchers to help identify optimal drug targets, profile new compounds, and determine 
disease pathways. An example of this technology is found in U.S. Patent No. 5807522, which is hereby incorporated 
by reference. 

It is possible to study the expression of ail genes in the genome of a particular microbial organism using a 
single array. For example, the arrays from Genosys consist of 12 x 24 cm nylon filters containing PCR products 
corresponding to 4290 ORFs from £ coll 10 ngs of each are spotted every 1.5 mm on the filter. Single stranded 
labeled cDNAs are prepared for hybridization to the array (no second strand synthesis or amplification step is done) 
and placed in contact with the filter. Thus the labeled cDNAs are of "antisense" orientation. Quantitative analysis is 
done by phosphorimager. 

Hybridization of cONA made from a sample of total cell mRNA to such an array followed by detection of 
binding by one or more of various techniques known to those in the art results in a signal at each location on the array 
to which cDNA hybridized. The intensity of the hybridization signal obtained at each location m the array thus reflects 
the amount of mRNA for that specific gene that was present in the sample. Comparing the results obtained for mRNA 
isolated from ceils grown under different conditions thus allows for a comparison of the relative amount of expression 
of each individual gene during growth under the different conditions. 

Gene expression arrays may be used to analyze the total mRNA expression pattern at various time points 
after induction of an antisense nucleic acid against a proliferation-required gene. Analysis of the expression pattern 
indicated by hybridization to the array provides information on whether or not the target gene of the antisense nucleic 
acid is being affected by antisense induction, how quickly the antisense is affecting the target gene, and for later 
timepoints, what other genes are affected by antisense expression. For example, if the antisense is directed against a 
gene for ribosomal protein L7/L12 in the 50S subunit, its targeted mRNA may disappear first and then other mRNAs 
may be observed to increase, decrease or stay the same. Similarly, if the antisense is directed against a different 50S 
subunit ribosomal protein mRNA (e.g. L25), that mRNA may disappear first followed by changes in mRNA expression 
that are similar to those seen with the L7/L12 antisense expression. Thus, the mRNA expression pattern observed 
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with an antinsense nucleic acid against a proliferation required gene may identify other proliferation-required nucleic 
acids in the same pathway as the target of the antisense nucleic acid. In addition, the mRNA expression patterns 
observed with candidate drug compounds may be compared to those observed with antisense nucleic acids against a 
proliferation-required nucleic acid. If the mRNA expression pattern observed with the candidate drug compound is 
similar to that observed with the antisense nucleic acid, the drug compound may be a promising therapeutic candidate. 
Thus, the assay would be useful in assisting in the selection of candidate drug compounds for use in screening 
methods such as those described below. 

In cases where the source of nucleic acid deposited on the array and the source of the nucleic acid being 
hybridized to the array are from two different organisms, gene expression arrays can identify homologous genes in the 
two organisms. 

The present invention also contemplates additional methods for screening other microorganisms for profiferation- 
required genes. In this embodiment, the conserved portions of sequences identified as proliferation-required can be used to 
generate degenerate primers for use in the polymerase chain reaction (PCR). The PCR technique is well known in the art. 
The successful production of a PCR product using degenerate probes generated from the sequences identified herein would 
indicate the presence of a homologous gene sequence in the species being screened. This homologous gene is then 
isolated, expressed, and used as a target for candidate antibiotic compounds. In another aspect of this embodiment the 
homologous gene is expressed in an autologous organism or in a heterologous organism in such a way as to alter the level 
or activity of a homologous gene required for proliferation in the autologous or heterologus organism. In still another aspect 
of this embodiment, the homologous gene or portion is expressed in an antisense orientation in such a way as to alter the 
level or activity of a nucleic acid required for proliferation of an autologous or heterologous organism. 

The homologous sequences to proliferation-required genes identified using the techniques described herein 
may be used to identify proliferation-required genes of organisms other than £ coli, to inhibit the proliferation of 
organisms other than £ coli by inhibiting the activity or reducing the amount of the identified homologous nucleic acid 
or polypeptide in the organism other than £ coli, or to identify compounds which inhibit the growth of organisms other 
than E. coli as described below. 

In another embodiment of the present invention, £ coli sequences identified as required for proliferation are 
transferred to expression vectors capable of function within non£ coli species. As would be appreciated by one of 
ordinary skill in the art, expression vectors must contain certain elements that are species specific. These elements 
can include promoter sequences, operator sequences, repressor genes, origins of replication, ribosomal binding 
sequences, termination sequences, and others. To use the identified exogenous sequences of the present invention, 
one of ordinary skill in the art would know to usb standard molecular biology techniques to isolate vectors containing 
the sequences of interest from cultured bacterial cells, isolate and purify those sequences, and subclone those 
sequences into an expression vector adapted for use in the species of bacteria to be screened. 

Expression vectors for a variety of other species are known in the art. For example, Cao et al. report the 
expression of steroid receptor fragments in Staphylococcus aureus. J. Steroid Biochem Mol Biol. 44(1):M1 
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(1993). Also, Pla et al. have reported an expression vector that is functional in a number of relevant hosts including: 
Salmonella typhimurium, Pseudomonas putida, and Pseudomonas aeruginosa. J. Bacteriol. 1 72(8):4448-55 (1990). 
These examples demonstrate the existence of molecular biology techniques capable of constructing expression vectors 
for the species of bacteria of interest to the present invention. 

Following the subcloning of the identified nucleic acid sequences into an expression vector functional in the 
microorganism of interest, the identified nucleic acid sequences are conditionally transcribed to assay for bacterial 
growth inhibition. Those expression vectors found to contain sequences that, when transcribed, inhibit bacterial 
growth are compared to the known genomic sequence of the pathogenic microorganism being screened or, if the 
homologous sequence from the organism being screened is not known, it may be identified and isolated by 
hybridization to the proliferation-required £ coll sequence interest or by amplification using primers based on the 
proliferation-required £ coli sequence of interest as described above. 

The antisense sequences from the second organism which are identified as described above may then be 
operably linked to a promoter, such as an inducible promoter, and introduced into the second organism. The 
techniques described herein for identifying £ cot genes required for proliferation may thus be employed to determine 
whether the identified sequences from a second organism inhibit the proliferation of the second organism. 

Antisense nucleic acids required for the proliferation of organisms other than £ coli or the genes 
corresponding thereto, may also be hybridized to a microarray containing the £ co/ZORFs to gauge the homology 
between the £ coli sequences and the proliferation-required nucleic acids from other organisms. For example, the 
proliferation-required nucleic ecid may be from Staphylococcus aureus, Pseudomonas aeruginosa, Enterobacter 
cloacae, Helicobacter pylori. Neisseria gonorrhoeae, Enterococcus faecalis, Streptococcus pneumoniae, Haemophilus 
influenzae. Salmonella typhimurium, Saccharomyces cerevisiae, Candida albicans, Cryptococcus neoformans, 
Aspergillus lumigatus, Klebsiella pneumoniae, Salmonella typhi. Salmonella paratyphi. Salmonella cholerasuis, 
Staphylococcus epidermidis. Mycobacterium tuberculosis, Mycobacterium leprae, Treponema pallidum, bacillus 
anthracis. Yersinia pestis, Clostridium botulinum, Campylobacter jejuni or Chlamydia trachomatus. Chlamydia 
pneumoniae or any species falling within the genera of any of the ebove species. The proliferation-required nucleic 
acids from an organism other than £ a?// may be hybridized to the array under a variety of conditions which permit 
hybridization to occur when the probe has different levels of homology to the sequence on the microarray. This would 
provide an indication of homology across the organisms as well as clues to other possible essential genes in these 
organisms. 

In still another embodiment, the exogenous nucleic acid sequences of the present invention that are identified as 
required for bacterial growth or proliferation can be used as antisense therapeutics for killing bacteria. The antisense 
sequences can be directed against the proliferation-required genes whose sequence corresponds to the exogenous nucleic 
add probes identified here (i.e. the antisense nucleic acid may hybridize to the gene or a portion thereof). Alternatively, 
antisense therapeutics can be directed egainst operons in which proliferation-required genes reside (i.e. the antisense 
nucleic acid may hybridize to any gene in the operon in which the proliferation-required genes reside). Further, antisense 
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therapeutics can be directed against a profiferation-required gene or portion thereof with or without adjacent noncoding 
sequences, an intragenic sequence (Le. a sequence within a gene), an allergenic sequence (Le. a sequence between gunes), 
a sequence spanning at least a portion of two or more genes, a 5" noncoding region or a 3' noncoding region located 
upstream or downstream from the actual sequence that is required for bacterial proliferation or an operon containing a 
proliferation-required gene. 

In addition to therapeutic applications, the present invention encompasses the use of nucleic acid sequences 
complementary to sequences required for proliferation as diagnostic tools. For example, nucleic acid probes complementary 
to proliferation-required sequences that are specific for particular species of microorganisms can be used as probes to 
identify particular microorganism species in clinical specimens. This utility provides a rapid and dependable method by 
which to identify the causative agent or agents of a bacterial infection. This utility would provide clinicians the ability to 
prescribe species specific antimicrobial compounds to treat such infections. In an extension of this utility, antibodies 
generated against proteins translated from mRNA transcribed from proliferation-required sequences can also be used to 
screen for specific microorganisms that produce such proteins in a species-specific manner. 

The following examples teach the genes of the present invention and a subset of uses for the £ coli genes 
identified as required for proliferation. These examples are iUustrative only and are not intended to limit the scope of the 
present invention. 

EXAMPLES 

The following examples are directed to the identification and exploitation of £ coli genes required for 
proliferation. Methods of gene identification are discussed as well as a variety of methods to utilize the identified 
sequences. 

Genes Identified as Required for Proliferation of £ coli 

Exogenous nucleic acid sequences were cloned into an inducible expression vector and assayed for growth 
inhibition activity. Example 1 describes the examination of a library of exogenous nucleic acid sequences cloned into IPTG- 
inducible expression vectors. Upon activation or induction, the expression vectors produced an RNA molecule 
corresponding to the subcloned exogenous nucleic acid sequences. The RNA product was in an antisense orientation with 
respect to the £ coB genes from which it was originally derived. This antisense RNA then interacted with sense mRNA 
produced from various £ coli genes and interfered with or inhibited the translation of the sense messenger RNA (mRNA) 
thus preventing protein production from these sense mRNA molecules. In cases where the sense mRNA encoded a protein 
required for the proliferation, bacterial cells containing an activated expression vector failed to grow or grew at a 
substantially reduced rate. 

EXAMPLE 1 

Inhibition of Bacterial Proliferation after IPTG induction 
To study the effects of transcriptional induction in liquid medium, growth curves were carried out by back 
diluting cultures 1:200 into fresh media with or without 1 mM IPTG and measuring the 0D 4H every 30 minutes (min). To 
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study the effects of transcriptional induction on solid medium. 10 2 . 10 3 , 10\ 10 5 , 10", 10 7 and 10* fold dfotions of 
overnight cultures were prepared. Aliquots of from 0.5 to 3 pi of these dilutions were spotted on selective agar plates 
with or without 1 mM IPTG. After overnight incubation, the plates were compared to assess the sensitivity of the clones 
to IPTG. 

Of the numerous clones tested, some clones were identified as a containing sequence that inhibited £ coli 
growth after IPTG induction. Accordingly, the gene to which the inserted nucleic acid sequence corresponds, or a gene 
within the operon containing the inserted nucleic acid, may be required for proliferation inf. coli. 

Characterization of Isolated Clones Negatively Affecting £ coli Proliferation 

Following the identification of those expression vectors that, upon expression, negatively impacted £ ntfgrowth 
or proliferation, the inserts or nucleic acid fragments contained in those expression vectors were isolated for subsequent 
characterization. Expression vectors of interest were subjected to nucleic acid sequence determination. 

EXAMPLE 2 

Nucleic Acid Sequence Determination of Identified Clones Expression Mucfeir AriH Fragments with Omental EffP«« nf F 

co//Proliferation 

The nucleotide sequences for the exogenous identified sequences were determined using plasmid DMA isolated 
using QIAPREP (Qiagen, Valencia, CA) and methods supplied by the manufacturer. The primers used for sequencing the 
inserts were 5' • TGTTTATCAGACCGCTT • 3* (SEQ ID NO: 403) and 5' • ACAATTTCACACAGCCTC - 3' (SEQ ID NO: 404). 
These sequences flank the polyKnker in pLEX5BA. Sequence identification numbers (SEQ ID NOs) for the identified inserts 
are listed in Table I and discussed below. 

EXAMPLE 3 

Comparison Of Isolated Sequences to Known Senuences 
The nucleic acid sequences of the subcloned fragments obtained from the expression vectors discussed above 
were compared to known £ coli sequences in GenBank using BLAST version 1.4 or version 2.0.6 using the following 
default parameters: Filtering off, cost to open a gap-5, cost to extend a gap-2, penalty for a mismatch in the blast 
portion of run -3, reward for a match in the blast portion of run- 1, expectation value |e)- 1 0.0. word size- 1 1. number of 
one-line descriptions- 100, number of alignments to show |B)-100. BLAST is described in Aitschul, J Mol Biol. 215:403- 
10 (1990), the disclosure of which is incorporated herein by reference in its entirety. Expression vectors were found to 
contain nucleic acid sequences in both the sense and antisense orientations. The presence of known genes, open reading 
frames, and ribosome binding sites was determined by comparison to public databases holding genetic information and 
various computer programs such as the Genetics Computer Group programs FRAMES and CODONPREFERENCE. Clones 
were designated as "antisense" if the cloned fragment was oriented to the promoter such that the RNA transcript 
produced was complementary to the expressed mRNA from a chromosomal locus. Clones were designated as "sense" if 
they coded for an RNA fragment that was identical to a portion of a wild type mRNA from a chromosomal locus. 
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The sequences described in Examples 1-2 that inhibited bacterial profiferation and contained gene fragments in 
an antisense orientation are Dsted in Table I. This table lists each identified sequence by: a sequence identification number; 
a Molecule Number; a gene to which the identified sequence corresponds, listed according to the National Center for 
Biotechnology Information (NCBI), Blattner (Science 277:1453-1474(1997); also contains the £ cofi K-12 genome 
5 sequence), or Rudd (Micro, and Mol. Rev. 62:985-1019 (1998)1 (both papers are hereby incorporated by reference) 

nomenclatures. The CONTIG numbers for each identified sequence is shown, as wefl as the location of the first and last 
base pairs located on the £ cofi chromosome. A Molecule Number with a indicates a done corresponding to an 
intergenic sequence. 

The sequences of the nucleic acid inserts of SEQ ID NOs: 1-81 from U.S. Provisional Patent Application No. 
1 0 60/1 1 7,405 which inhibited proliferation were further analyzed. The reanalyzed sequences corresponding to SEQ ID NOs. 

1-81 of U.S. Provisional Patent Application No. 60/1 17,405 have SEQ ID NOs. 405-485 in the present application. 

SEQ ID NOs: 82-242 in U.S. Provisional Patent Application No. 60/117,405 are identical to SEQ ID NOs: 82- 
242 of the present application with the following exceptions. SEQ ID NO: 148 in the present application is the 
complementary strand of SEQ ID NO: 148 in U.S. Provisional Patent Application No. 60/117,405. Accordingly, the 
15 protein of SEQ ID NO: 308 which is encoded by SEQ ID NO: 148 has also been revised. SEQ ID NO: 163 in the present 

application is the complementary strand of SEQ ID NO: 163 in U.S. Provisional Patent Application No. 60/117,405. 
Accordingly, the protein of SEQ ID NO: 323 which is encoded by SEQ ID NO: 163 has also been revised. 

The target gene of SEQ ID NOs. 18 and 19 of U.S. Provisional Patent Application No. 60/117,405 (SEQ ID 
NOs. 18, 19, 422, 423 of the present application) has been revised from dicF to ftsZ to reflect the fact that these 
20 SEQ ID NOs. include natural antisense molecules which inhibit ftsZ expression. 

The gene products of the nucleic acids of SEQ ID NOs. 198 and 239-242 in U.S. Provisional Patent 
Application No. 60/117,405 and in the present application (SEQ ID NOs. 358 and 399402 of the present application) 
have been revised to reflect the fact that these nucleic acids encode nontranslated tRNAs and rRNAs. Tables I and II 
have been revised accordingly. The SEQ ID NOs. in Table II were also revised to reflect the fact that SEQ ID NOs: 89 
25 and 402 were identical in U.S. Provisional Patent Application No. 60/1 1 7,405. 

TABLE I 

Identified Clones with Corresponding Genes and Operons 



SEQ ID 


Molecule No. 


Gene 


Gene 


Gene 


CONTIG 


NO. 




(NCBI) 


(Blattner) 


(Budd) 




1,405 


EcXAOOl 


yhhQ 


b3471 


yhhQ 


AE000423 


2,406 


EcXA002 


lepB 


lepB 


lepB 


AE000343 


3,407 


EcXA003 


f586 


b0955 


ycbZ 


AE000197 


4, 408 


EcXA004 


rpsG, rpsL 


b3341 


rpsG, rpsL 


AE000410 


5, 409 


EcXA005a 


rpIL, rplJ 


b39BB 


rpIL, rpU 


AE000472 


6,410 


EcXA005b 


rpIL 


rp/L 


rp/L 


AE000472 


7,411 


EcXA005c 


rpIL, rplJ 


rplLrpU 


rp/L, rplJ 


AE000472 


8,412 


EcXA005d 


rpIL, rpU 


rpIL, rplJ 


rp/L, rp/J 


AE000472 


9,413 


EcXA005e 


rp/L 


rpIL 


rp/L 


AE000472 
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SEQID 

NO. 


Molecule No. 


i Gene 
(NCBI) 


Gene 
(Blattner) 


Gene 
(Rudd) 


UUH 1 111 


10,414 


EcXA005 


f rpll 


rpli 


rpli 


MtUUU*t/ 1, 


11,415 


EcXA005c 


nil 


rpll 


rpll 


AF0flQ47? 


12,416 


EcXAOOG 


pti 


' b229) 


' Ptc 


nLUUUJ 1 %3 


13,417 


EcXA007 


yicf 


1 b366t 


yicf 


p AE000444 


14,418 


EcXA008a 


yhab 


f \ b312? 


r yhab 


' ApnnmoA 

nCUUUuW 


15,419 


EcXAOOBb 


' yhab 


' yhab 


' vhab 




16, 420 


EcXAOOSc 


yhab 


' yhab 


JrflaU 




17.421 


EcXA009 


ydelf 


ydeY 


vrlpY 

VUG 1 


MCUUU/H3 


18, 422 


EcXA010a 
(natural as) 


dicF 


b1575 


tiirf 


MtUUUZOJ 


19,423 


EcXAOlOb 


dicF 


dicF 


d/cf 


AEQD09R'? 


20,424 


EcXA011 


fdnG 


b!474 


fdnG 




21,425 


EcXA012a 


fusA 


b334Q 


ft is A 


MCUUUf I U 


22,426 


EcXA012b 


fusA 


fusA 


fusA 


APnnnAin 


23, 427 


EcXA012c 


fu$A 


fusA 


fusA 


AFnnrum 


24, 428 


EcXA013a 


086 


b2562 


yfhL 


AE00034? 


25, 429 


_ EcXA013h 


086 


b2562 


yfhL 


AE0QQ34? 


26, 430 


EcXA013c 


086 


b2562 


yfhL 


AEQQ034? 


27, 431 


EcXA014 


visC 


b2906 


visC 


nLUUUJ/ H 


28, 432 


EcXA015 


yfdl 


yfdl 


yfdl 




29, 433 


EcXA016 


ysaQ 


yeaQ 


yeaQ 








yoaG 


yoaG 


r uau 




30, 434 


EcXA017a 


ygtf 


b2922 


vooF 


AFDnni7R 

nLUUUJ/ 9 


31,435 


EcXAOUb 


V99E 


vqqE 


vooF 


AFnnrrc75 


32,436 


EcXA018a 


0464 


b2Q74 


vboM 


AFonn9Q7 

nkUUlA&EJ / 


33,437 


EcXA018b 


o464 


b2074 


venM 


AF(inn7Q7 


34,438 


EcXA019a 


yehA 


yehA 


yehA 


AFnnrnnn 












AE0002QP 


35,439 


EcXA019b 


of 72, yehA 


of 72, yehA 


of 72 vshA 


AFnnn?QQ 


36,440 


EcXA020 


o384 f f82 


b!794 t b1795 


veaP veaO 


AFfinn974 


37,441 


EcXA021a 


ffJ2 


b02J8 


yafU 


nLUUU 1 JU 


38,442 


EcXA021b 


f112 


b0218 


vaffl 


Apnnnnn 


39,443 


EcXA022 


o740 


hi 629 


vrinN 
Yuytv 


AcnnnORQ 
HCUUU£gO 


40,444 


EcXA023a 


f176, f382 


b1504, b1505 


vdpS vHpT 


APnnn9A7 

HCUUUi'f / 


41,445 


EcXA023b 


f!76, f382 


M504, bJ505 


vdeS vdeT 


AFnnri747 


42,446 


EcXA024 


yffM yg/rV 


b3Q82 


voiM voiN 


AFnnrrcon 


43,447 


EcXA025 


02383 


bJ878 


yeeJ 


AFnnn?RQ 


44,448 


EcXA026 


o61 


Unpredicted 


Unpre-dicted 


AE0D0138 


45,449 


EcXA027a 


yohH 


yohH 


yohH 




46,450 


EcXA027b 


yohH 


yohH 


yohH 


MLUUUOU J 


47,451 


EcXA027c 


yohH 


yohH 


yohH 








yohl 


yohl 


yohl 




48,452 


EcXA027d 


yohH 


yohH 


yohH 


Acnnnono 


49,453 


ECXA028 


f296 


b2305 


yfcl 


AE000319 


50, 454 


EcXA029 


viiK 


b4391 


Yfl\ 


AE000509 


51,455 


EcXA030 


yi5A 


b3557 


yi5A 


AE000433 


52,456 [ 


EcXA031 


rplE 


B3308 


rplE 


AE000408 


53, 457 


EcXA032a 


ybgD 


ybgD 


ybgD 


AE000175 


54,458 


EcXA032b" 


yhgD 


ybgD 


ybgD 


AE000175 
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SEQID 


Molecule No. 


Gene 


Gbdr 


UOlltf 


rnunr 


NO. 




(NCBI) 


(Blattner) 


\nuuuf 








gttA 


QUA 


oftA 

yiiri 




55, 459 


EcXA033a 


f 477 (as) 


b3Q52 


VwaaL 


AFnnrreQ7 












flpnnnipc 


56, 460 


EcXA033b 


f477 


b3052 


wVOOC 


APnnniQ7 


57, 461 


EcXA034a 


cspA 


b355B 


LSfJH 


AcnnnA'3'3 


58, 462 


EcXA034b 


cspA 


UtjUUO 


cspA 


AtUUU4d3 


59, 463 


EcXA035 


vhiU 


Mil 
yn/u 


it hill 

ynju 


AEUUU431 


60, 464 


EcXA036 




b3I0f 


YQlE 








o99 


U*J IUU, 


t/m'tY 




61,465 


EcXA037 


yucn 


0ID03 


t/rtnU 

yueri 


AE00D251 


62, 466 


EcXA038 


s/sB 


U 1 J3%} 


s/sB 


AtUQQ233 


63, 467 


EcXA039 


vbbD 

yuuu 




yuDU 


a con pi 1 ce 


64, 468 


EcXA040 


InsB 6 


UOHH*J 


inso o 


Acnnn/ion 
AtUUU4ZD 


65, 469 


EcXA041 


f234 


b1138 

u § i *ju 


ymfE 


Acnnnoi a 
AtUUUl 14 


66, 470 


EcXA042a 


rplY 


mIY 


mIY 


Acnnnono 
AtUUUJUo 


67, 471 


EcXA042b 


rplY 




rnlY 


AtUUUJUo 


68, 472 


EcXA043 


vboB 


vbnR 
yugo 


yugo 


MtUUU 1 /D 






cydA 


ruHA 
C yUri 


cydA 




69,473 


EcXA044 


purB 


him 


ntirR 


AtUUUZl J 


70, 474 


EcXA045 # * 


csrA 




csrA 


AtUUUJOJ 






s$rV 


AO/ ¥ 


earl/ 




71,475 


EcXA046 ## 


fimE fi/nA 


b4313 


ft/nF ftmA 
tin IE, llulH 


Acnnncno 


72, 476 


EcXAQ47 ## 


f96, csdB 


fS6 csdB 


rcn/7 utifQ 


AtUUUZDZ 


73,477 


EcXA048 


yefE 


vefF 


vpff 


Acuuuzy4 


74, 478 


EcXA049 


yaiC 




ya/u 


AcnnniA^ 

MCUUU I*f0 


75, 479 


EcXA050 


o467 o222 




x/aill if oil/ 

yaiUf ysiw 


Acnnni/i/! 

MCUUU l*Hf 


76, 480 


EcXA051a 


wlB rnlW 

ifllU, 9 fit WW 


rnIR rnlW 

Iff ID, If/Iff 


mIR mlW 


AcUUUhUo 


77,481 


EcXA051b 


mm 


rplW 


rplW 


AE000408 


78, 482 


EcXA052 


infC 


infC 


infC 


AE000267 












AE0Q0266 


79, 483 


EcXA053 


gor 


gor 


gor 


AE000426 


80,484 


EcXA054 


rplF 


rplF 


rplF 


AE000408 


81,485 


EcXA055 


rrIG 


rrIG 


rrIG 


AE000345 



EXAMPLE 4 

Identification of Genes an d their Corresponding Ooerons Affected bv Antisense Inhibition 
The sequencing of the entire E. coli genome is described in Blattner et al., Science 277:1453-1474(1997) the 
entirety of which is hereby incorporated by reference and the sequence of the genome is listed in GenBank Accession 
No.U00096, the disclosure of which is incorporated herein by reference in its entirety. The operons to which the 
proliferation-inhibiting nucleic acids correspond were identified using RegulonDB and information in the literature. The 
coordinates of the boundaries of these operons on the E. coli genome are listed in Table III. Table II lists the molecule 
numbers of the inserts containing the growth inhibiting nucleic acid fragments, the genes in the operons corresponding to 
the inserts, the SEQ ID NOs of the genes containing the inserts, the SEQ ID NOs of the proteins encoded by the genes, the 
start and stop points of the genes on the E. coli genome, the orientation of the genes on the genome, whether the operons 
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are predicted or documented, and the predicted functions of the genes. The identified operons, their putative functions, 
and whether or not the genes are presently thought to be required for proliferation are discussed below. 

Functions for the identified genes were determined by using either Blattner functional class designations or by 
comparing identified sequence with known sequences in various databases. A variety of biological functions were noted 
for the genes to which the clones of the present invention correspond. The functions for the genes of interest i 
Table II. 

The proteins that are listed in Table II are involved in a wide range of biological functions. 
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Predicted functional class 
of encoded proteins 


Hypothetical outer 
membrane orotein 


Resistance to phage C1; 
periplastic protein perhaps 
anchored to inner membrane 


Secretion 


Protease 


Translation (Elongation 
factor Tu) 


Translation (elongation 
factor efo) 


Translation 


Translation (rRNA) 


Translation 


Translation 


Translation 


Carbon compound 
catabofism 


Probable adenine deaminase 


Blattner functional class 
of encoded proteins 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Transport and binding 
proteins 


Unknown 


Translation, post- 
translational modification 


Translation, post- 
radiational modification 


Translation, post- 
radiational modification 


Translation, post- 
translational modification 


Translation, post- 
translational modification 


Translation, post- 
translational modification 


Translation, post- 
translational modification 


Carbon compound 
catabolism 


Hypothetical ORF, 1 
unclassified, unknown 


Predicted (P) 
Or 

Documented 
ID] 
Operon 


a. 




Q- 


a. 


Q 










o 








Right 
Coordinate 


3607513 


3608143 


2703329 


1017522 


3468966 


3471151 


3471718 


2729178 


3471815 


4178071 


4178503 


2414911 


3843357 


Left Coordinate 


3606848 


3607532 


2702355 


1015762 


3467782 


3469037 


3471179 


2727636 


3471815 


4177574 


4178138 


2412767 


3841591 


Genes On 
Operon 


i 


i 


lepB 


1 


tufA 


i 


i 




I 


% 






1 


Mole. No. 


EcXAOOl 
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WO 00/44906 



PCT/US00/02200 



Predicted functional class 
of encoded proteins 








Probable integral membrane 
protein Phthalate permease 
family 




Putative ABC transoorter 










Regulator of cell division 


Anaerobic respiration 
(formate dehvdro-oenase) 


3 






No homologues, no motifs 


Ubiquinone synthesis 


Blattner functional class 
of encoded proteins 


Hypothetical ORF, 
unclassified, unknown 


Putative enzymes 


Hypothetical ORF, 
unclassified, unknown 


Carbon compound 
catabolism 


Putative transport proteins 


Putative transport proteins 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Cell processes (incl. 
Adaptation, protection) 


Energy metabolism 


Energy metabolism 


Energy metabolism 




Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 1 
unclassified, unknown 


Predicted (P) 
Or 

Documented 

(D) 

Dnprnn 

UUCI Ull 






















Cu 


o 








Q_ 




Right 
Coordinate 


3269492 


! 3270407 


3271198 


3272548 


1601049 


| 1602071 


1603063 


1604097 


1604999 


1605313 


106456 


1548472 


1549369| 


1550015 




2697943 


3050337 


Left Coordinate 


3268266 


I 3269508 


3270428 


3271214 


1599514 


| 1601043 


1602071 


1603075 


1604124 


1605023 


105305 


1545425 


15484851 


15493621 




2697683 


3049135 
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WO 00/44906 



PCT/US00/02200 



Predicted functional class 
of encoded proteins 










Putative membrane protein 


Homologue to transgly- 
cosylase associated protein 


No homologues 




Homologues in multiple 


Transport (multiple 


U 
J 

3 

2 

> 

> 
5 

3 

3 
> 

! 




Veak homology to pilin 
irecursor from H. Inf. 


Biattner functional class 
of encoded proteins 


Biosynthesis of cofactors, 
prosthetic groups and 
carriers 


Translation, post- 
translational modification 


Hypothetical 0RF f 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Lell structure 
Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
'unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Structural proteins 


Putative transport proteins 


Hypothetical ORF, 
unclassified, unknown 


hypothetical ORF, 
mclassified, unknown 


'utative transport proteins 
"ell structure \ 


Predicted (P) 
Or 

Documented 
(D) 
Operon 








Q_ 






Q- 




Q- 








Q, 


' Right 
Coordinate 


1 3051538 


3052860 


3053470 


2466237 


2468482 


1877279 


1877609 


1877972 


3066100 


2153285 


2156407 


2159485 


2186434 


Left Coordinate 


3050360 


3051535 


3052886 


2465875 


2467151 


1877031 


1877427 


1877613 


3065360 


2151891 


2153285 


2156408 


2185400 
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WO 00/44906 



PCT/US00/02200 





Predicted functional class 
of encoded proteins 










Homologues in H. Inf. and S. 
Pombe., no motifs, 
transmem brane region 
oresent 


















imMike 


Blattner functional class 
of encoded proteins 


Hypothetical ORF, 
unclassifiei unknown 


Putative chaperones 


Cell structure 




Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassifiei unknown 


Hypothetical ORF, 
unclassifiei unknown 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassifiei unknown 


Hypothetical ORF, 
unclassifiei unknown 


Hypothetical ORF, 
unclassifiei unknown 


Transcription, RNA 
processing and degradation 


ffypothetical ORF, 
jnclassifiei unknown 


hypothetical ORF, j 
mclassified, unknown 


Predicted (P) 
Or 

Documented 
ID) 
Operon 










Q_ 
















Q_ 




Right 
Coordinate 


2188930 


2189665 


2190242 


239084 


1704372 


1704950 


1707165 


1708224 


1708848 


1709547 


1710182 


1586320 


1586863 




Left Coordinate 


2186450 


! 2188946 


i 2189700 




238746 


1703791 


1704372 J 


1704943 


1707166 


1708228 


1708852 


1709547 


1585817 


1586333 
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WO 00/44906 



PCT7USOO/02200 



Predicted functional class 
of encoded proteins 


fimdiike 


Weak homology to long 
chain fatty acid coa ligase in 
Archaeolobus 


Homoiogues in various 
bacteria 


Strong similarity to 
numerous attaching amd 
effacing proteins and 
invasins 


nifm like 




Xylose binding protein like 




CU 

cr 
J5 
( j2 
!H 

*. 

to g 
a 

.E 3 


r v 

Similar to ABC transporter 


IS 150 orfA 










Blattner functional class 
of encoded proteins 


Structural proteins 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 




Putative transport proteins 


Hypothetical ORF, 
unclassified, unknown 


Putative regulatory protein 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Phage, transposon, or 
plasmid 


Translation, post 
translational modification 


Putative transport oroteins 


r r _ 

translation, post- 
[ranslationai modification 


Predicted (P) 
Or 

Documented 
ID) 
Operon 




cu 




cu 


| unpredicted 






CU 


cu 


cu 


cu 
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Right 
Coordinate 


1588025 


3231785 


3232096 


2050036 


! 331184 


2226539 


2226859 


2228405 | 


2421559 


4628091 


3718830 


3719678 


3440371 


3441734| 


3442176 


Left Coordinate 


! 1586877 


3231369 


3231782 


2042885 


! 331001 


2225343 


2226569 


2227458 


2420669 


4626424 


3718309 


3718827 


3440255 


3440403) 


3441742 
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WO 00/44906 



PCT/USOO/02200 



Predicted functional class 
of encoded proteins 








Translation 






Translation 






Hypothetical fimbrial protein 
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RNA chaperonin 




Blattner functional class 
of encoded proteins 


Translation, post- 
translational modification 


Translation, post- 
translational modification 


Translation, post* 
translational modification 


Translation, post- 
translational modification 


Translation, post* 
translational modification 


Translation, post* 
translational modification 


Translation, post- 
translational modification 


Translation, post- 
translational modification 


Translation, post- 
translational modification 


Cell processes (incl. 
Adaptation, orotection) 


Energy metabolism 


Putative enzymes 


Translation, post* 
translational modification 


Hypothetical 0RF, 
unclassified, unknown 


Dell processes (incl. 
Adaptation, orotection) 


Franslation, post- 
ranslational modification 


Predicted (P) 
Or 

Documented 
ID) 
Operon 




















cu 




cu 






cu 




Right 
Coordinate 


3442359 


3442866 


3443234 


3443777 


3444182 


3444521 


3445075 


3445404 


3445786 


| 752018 


753691 1 


3194394 


3197282 


3198606 


3717890 


3695658 


Left Coordinate 


3442180 


3442363 


3442881 


3443244 


3443790 


3444216 


3444536 


3445090 


3445415 


751452 


7524081 


3192961 


3194442 


3197305 


371767B 


3694087 
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WO 00/44906 



PCIYUSOO/02200 



Predicted functional class 
of encoded proteins 




Regions similar to dehydro- 
genases, nucleases etc. 










Homologues in many 
bacteria, blocks; secretion/ 
ATP synthasejftsz 


Similar to carboxy-kinase, 
oxidase, svmnorters 


Super-infection exclusion 
factor B-like 








Rhs like element 


ATP synthase, desaturase 


Biattner functional class 
of encoded proteins 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Similar to mukb from H. Inf. 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Phage, transposon, or 
plasmid 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Predicted (P) 
Or 

Documented 
(0) 

flnprnn 

up CI Ull 






a. 








a. 




cu 




Cu 








Right 
Coordinate 


3695846 


3697522 


3246977 


3247320 


3247727 


3248016 


3248594 


1621874 


1417183 


1417368 


526765 


527173 


527883 


528124 


Left Coordinate 


3695658 


3695843 


3246594 


3247015 


3247323 


3247717 


1 3248112 


1620984 


1416572 


1417192 


522485 


526805 


527173 


527864 


Genes On 
Operon 


yhjT 






i 








i 


QQ 


I 


s 


CO 

* 


$ 


i 


d 

as 
"5 

s 






EcXA036 










CO 

o 

<c 

X 
u 

UJ 


EcXA038 




cn 

CO 

o 
«t 

X 
u 

UJ 








Gene Prod. 
Seq ID No. 


O 

CO 
CO 


CO 
CO 


CM 
CO 
CO 


CO 
CO 
CO 


CO 
CO 


LO 
CO 
CO 


CO 
CO 
CO 


r-* 

CO 

CO 


CO 
CO 
CO 


cn 

CO 
CO 


O 
CO 


CO 


CM 
CO 


CO 
CO 


GeneSeq 
ID 
No. 

- 


CD 




CM 


CO 

Px 


*r 


Ln 


CO 


r>. 


CO 


CD 


o 




CM 


CO 



-33- 



WO 00/44906 



PCTAJSOO/02200 



Predicted functional class 
of encoded proteins 












No assigned role 


No assigned role 


Translation 




Unknown 


Cytochrome 0 oxidase 




Purine biosynthesis 


Carbon storage regulator 
(mRNA decay factor) 


Translation ftRNA) 




Fimbrae 


Regulator of inversion 


Blattner functional class 
of encoded proteins 


Hypothetical ORF, 
unclassified, unknown 


Phage, transposon, or 
plasmid 


Phage, transposon, or 
plasmid 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Translation, post- 
radiational modification 


Translation, post* 
translational modification 


Carbon compound 
catabolism 


Energy metabolism 


Energy metabolism 


Nucleotide biosynthesis and 
metabolism 


Regulatory function 


Translation, post- 
translational modification 


Cell structure 1 


Cell structure 


Cell structure 


Predicted (P) 
Or 

Documented 
(0) 
Operon 




cu 








Q- 




CU 
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□ 








Unpredicted 


a 






Right 
Coordinate 


528354 


351389 


3581811 


3581085 


3580672 


1196755 


1197460 


2280821, 


767183 


769834 


772249 


773404 | 


1191209 


2817168 


2816667 


45391271 


4540201 | 


4541231 


Left Coordinate 


528163 


351114 


351308 


! 3580669 


3579494 


0609611 


1196756 


2280537 


765207 


767201 


770678 


772265 


1189839 


2816983 


2816575 


4538525| 


45396051 


4540683| 
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WO 00744906 



PCTAJS00/02200 



Predicted functional class 
of encoded proteins 




















Lysis protein 
















UDP galacto-pyranase 


Jnknown 


'utative auto-transporter 


Blattner functional class 
of encoded proteins 


Cell structure 


Cell structure 


Cell structure 


Cell structure 


Cell structure 


Cell structure 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Cell processes (incl. 
Adaptation, protection) 


Phage, transposon, or 
plasmid 


Putative enzvmes 1 


Hypothetical ORF, 
unclassified, unknown 


Putative enzvmes 


Hypothetical ORF, 
unclassified, unknown 


Cell structure 


Hypothetical ORF, I 
unclassified, unknown i 


Hypothetical ORF, 1 
unclassified, unknown 


Putative enzymes 1 


Predicted (P) 
Or 

Documented 
(D) 
Oneron 

VUWI If II 














Cu 








Cu 


Cu 
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Right 
Coordinate 


I 4541835 


4542597 


4545301 


4545841 


4546357 


I 4547279 


1638684 


1638081 


1638389 


1638684 


1639578 


2100933 


210141l| 


2102531 


21031061 


2104079 


2105248| 


2106351 


404042 


393642 


Left Coordinate 


I 4541188 


4541872 


! 4542665 


I 4545311 


| 4545854 


1 4546377 


1637054 


1637548 


1638078 


1638394 


1639363 


2099917 


21009381 


2101413 


2102516 
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2105248 


402927 


392239 
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WO 00/44906 



PCT/USOO/02200 



Predicted functional class 
of encoded proteins 


Hypothetical outer 
membrane orotein 














'Translation 


Translation 












Translation 


Biattner functional class 
of encoded proteins 


Hypothetical ORF, 
unclassified, unknown 


Translation, post- 
translational modification 


Translation, post- 
translational modification 


Translation, post- 
radiational modification 


Translation, post- 
translational modification 


Translation, post- 
translational modification 


Translation, post- 
translational modification 


Translation, post- 
radiational modification 


Translation, post- 
translational modification 


Translation, post- 
translational modification 


Translation, post- 
translational modification 


Translation, post- 
translational modification 


Translation, post- 
translational modification 


Translation, post- 
translational modification 


Translation, post- 
radiational modification 


Predicted (P) 
Or 

Documented 
(D) 

Onprnn 

VJJGI (III 




a 
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Right 
Coordinate 


394353 


3446205 


3446396 


3446806 


3447520 


3447870 


3448163 


3449001 


3449321 


3449923 


3450563 


3450907 


1797773 


1798023 


1798662 


Left Coordinate 


393685 


3445951 


3446205 


3446396 


3446819 


3447538 


3447885 


3448180 


3449019 


3449318 
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Several of the expression vectors contain fragments that correspond to genes of unknown function or if the function 
is known, it is not known whether the gene is essential. For example, EcXAOOl, 003, 007, 008, 013, 015, 016, 017. 018, 
019, 020, 021, 022, 023. 024, 025, 026, 027, 028, 029, 030, 032. 033. 034, 035, 036, 037, 038, 039, 040, 041. 047. 
048. 049 and 050 are all exogenous nucleic acid sequences that correspond to £ coS proteins that have no known function or 
where the function has not been shown to be essential or nonessential. 

The present invention reports a number of novel £ coffgenes and operons that are required for proliferation. From the 
list clone sequences identified here, each was identified to be a portion of a gene in an operon required for the proliferation of £ 
coli. Cloned sequences corresponding to genes already known to be required for proliferation in £ coli include EcXA002, 004. 
005. 010. 012, 014, 031, 02. 043, 045, 051, 052, 054, and 055. The remaining identified sequences correspond to £ coli 
genes previously undesignated as required for proliferation in the art. 

An interesting observation of the present invention is that there are also several sequence fragments that correspond 
to £ coli genes that are not thought to be required for £ ^proliferation. Nevertheless, under the conditions described above, 
the antisense expression of these gene fragments causes a reduction in cell growth. This result implies that the genes 
corresponding to the identified sequences are actually required for proliferation. Molecule Nos. corresponding to these genes 
areEcXA006,044,046,and053. 

Following identification of the sequences of interest, these sequences were localized into operons. Since bacterial 
genes are expressed in a polycistronic manner, the antisense inhibition of a single gene in an operon might effect the expression 
of all the other genes on the operon or the genes down stream from the single gene identified. In order to determine which of 
the gene products in an operon are required for proliferation, each of the genes contained within an operon may be analyzed for 
their effect on viability as described below. 
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TABLE 111 



Qperon Boundarigs 



Mole. No. 


Left 
Coordinate 


Right 
Coordinate 


EcXAQOI 


3606848 


1 3608143 


EcXA002 


2702355 


2703329 


EcXA003 


1015762 


1017522 


EcXA004 


3467782 


3472189 


EcXA005 


4177574 


4178503 


EcXA006 


2412767 


2414911 


EcXA007 


3841591 


3843357 


EcXA008 


3268266 


3272548 


EcXAOOg 


1599514 


1605313 


EcXAOlO 


1647406 


1647458 


EcXA011 


1545425 


1550015 


EcXA012 


3467782 


3472189 


EcXA013 


2697683 


2697943 


EcXA014 


3049135 


3053470 


EcXAOl 5 


2465875 


2468482 


EcXA016 


1877031 


1877972 


EcXA017 


3065360 


3066100 


EcXAOl 8 


2151891 


2160901 


EcXA019 


2185400 


2190242 


EcXA020 


1877031 


1877972 


EcXA021 


238746 


239084 


EcXA022 


1703791 


1710182 


EcXA023 


1585817 


1588025 


EcXA024 


3231369 


3232096 


EcXA025 


2042885 


2050036 


EcXA026 


331001 


331184 


EcXA027c 


2225343 


2228405 


EcXA028 


2420669 


2421559 


EcXA029 


4626424 


4628091 


EcXA030 


3718309 


3719678 


EcXA031 


3440255 


3445786 


EcXA032b 


751452 


753691 


EcXA033 


3192961 


3198606 
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Mole. No. 
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EXAMPLE 5 

Identification of Individual Genes within an Operon Renuired for Proliferatinn 
The following example illustrates a method for determining which gene in an operon is required for proliferation. 
The clone insert corresponding to Molecule No. EcXA004 possesses nucleic acid sequence homology to the £ coB genes 
rspG and rspL This molecule corresponds to an operon containing two additional genes fusA and tufA. The rpsL gene is 
the first gene in the operon. To determine which gene or genes in this operon are required for proliferation, each gene is 
selectively inactivated using homologous recombination. GensrpsL is the first gene to be inactivated. 

Deletion inactivation of a chromosomal copy of a gene in f. coff can be accomplished by integrative gene 
replacement. The principle of this method (Hamilton, C. M., et al 1989. J. Bacterial. 171: 4617-4622) is to construct a 
mutant allele of the targeted gene, introduce that allele into the chromosome using a conditional suicide vector, and then 
force the removal of the native wild type allele and vector sequences. This will replace the native gene with a desired 
mutation(s) but leave promoters, operators, etc. intact. Essentiality of a gene is determined either by deduction from 
genetic analysis or by conditional expression of a wild type copy of the targeted gene (trans complementation). 

The first step is to generate a mutant rpsL allele using PCR amplification. Two sets of PCR primers are chosen 
to produce a copy of rpsL with a large central deletion to inactivate the gene. In order to eliminate polar effects, it is 
desirable to construct a mutant allele comprising an in-frame deletion of most or all of the coding region of the^wZ gene. 
Each set of PCR primers is chosen such that a region flanking the gene to be amplified is sufficiently long to allow 
recombination (typically at least 500 nucleotides on each side of the deletion). The targeted deletion or mutation will be 
contained within this fragment. To facilitate cloning of the PCR product, the PCR primers may also contain restriction 
endonuclease sites found in the cloning region of a conditional knockout vector such as pK03 (Link, et al 1997 J. 
Bacterial. 179 (20): 6228-6237). Suitable sites include Notl, Sail, BamHI and Smal. The rpsL gene fragments are 
produced using standard PCR conditions including, but not limited to, those outlined in the manufacturers directions for the 
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Hot Start Taq PCR kit (Qiagen, Inc., Valencia, CA). The PCR reactions will produce two fragments that can be fused 
together. Alternatively, crossover PCR can be used to generate e desired deletion in one step (Ho, S. N., et al 1989. Gene 
77: 51 59, Horton, R. M., et al 1989. Gene 77: 61 68). The mutant allele thus produced is called a "null' allele because it 
cannot produce a functional gene product. 

The mutant allele obtained from PCR amplification is cloned into the multiple cloning site of pK03. Directional 
cloning of the rpsl null allele is not necessary. The pK03 vector has a temperature-sensitive origin of replication derived 
from pSCIOI. Therefore, clones are propagated at the permissive temperature of 30°C. The vector also contains two 
selectable marker genes: one that confers resistance to chloramphenicol and another, the Bacillus subtilis sacB gene, that 
allows for counter-selection on sucrose containing growth medium. Clones that contain vector DNA with the null allele 
inserted are confirmed by restriction endonuclease analysis and DNA sequence analysis of isolated piasmid DNA. The 
plasmid containing the rpsl null allele insert is known as a knockout piasmid. 

Once the knockout piasmid has been constructed and its sequence verified, it is transformed into a Rec* £ coli 
host cell. Transformation can be by any standard method such as electroporation. In some fraction of the transformed 
cells, plasmids will integrate into the £ coli chromosome by homologous recombination between the rpsl null allele in the 
piasmid and the rpsl gene in the chromosome. Transformant colonies in which such an event has occurred are readily 
selected by growth at the non-permissive temperature of 43°C and in the presence of choramphenicol. At this 
temperature, the piasmid will not replicate as an episome and will be lost from cells as they grow and divide. These cells 
are no longer resistant to chloramphenicol and will not grow when it is present. However, cells in which the knockout 
piasmid has integrated into the£ coli chromosome remain resistant to chloramphenicol and propagate. 

Cells containing integrated knock-out plasmids are usually the result of a single crossover event that creates a 
tandem repeat of the mutant and native wild type alleles of rpsl separated by the vector sequences. A consequence of 
this is that rpsl will still be expressed in these cells. In order to determine if the gene is essential for growth, the wild type 
copy must be removed. This is accomplished by selecting for piasmid excision, a process in which homologous 
recombination between the two alleles results in looping out of the piasmid sequences. Cells that have undergone such an 
excision event and have lost piasmid sequences including sacB gene are selected for by addition of sucrose to the medium. 
The sacB gene product converts sucrose to a toxic molecule. Thus counter selection with sucrose ensures that piasmid 
sequences are no longer present in the cell. Loss of piasmid sequences is further confirmed by testing for sensitivity to 
chloramphenicol (loss of the chloramphenicol resistance genel. The latter test is important because occasionally a 
mutation in the sacB gene can occur resulting in a loss of sacB function with no effect on piasmid replication (Link, et. al., 
1997./. Bacteriol. 179 (20): 6228-6237). These artifact clones retain piasmid sequences and are therefore still resistant 
to chloramphenicol. 

In the process of piasmid excision, one of the two rpsl alleles is lost from the chromosome along with the 
piasmid ONA. In general, it is equally likely that the null allele or the wild type allele will be lost. Therefore, if the rpsl 
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gene is not essential half of the clones obtained in this experiment will have the wild type allele on the chromosome and 
half will have the null allele. However, if the rpsl gene is essential, cells containing the null allele will not be obtained as a 
single copy of the null allele would be lethal. 

To determine the essentiality of rpsL, a statistically significant number of the resulting clones, at least 20, are 
analyzed by PCR amplification of the rpsl gene. Since the null allele is missing a significant portion of the rpsL gene, its 
PCR product is significantly shorter than that of the wild type gene and the two are readily distinguished by gel 
eiectrophoretic analysis. The PCR products may also be subjected to sequence determination for further confirmation by 
methods well known to those in the art. 

The above experiment is generally adequate for determining the essentiality of a gene such as rpsL However, it 
may be necessary or desirable to more directly confirm the essentiality of the gene. There are several methods by which 
this can be accomplished. In general, these involve three steps: 1) construction of an episome containing a wild type 
allele, 2) isolation of clones containing a single chromosomal copy of the mutant null allele as described above but in the 
presence of the episomal wild type allele, and then 3) determining if the cells survive when the expression of the episomal 
allele is shut off. In this case, the trans copy of wild type rpsL is made by PCR cloning of the entire coding region of rpsL 
and inserting it in the sense orientation downstream of an inducible promoter such as the £ caff lac promoter. 
Transcription of this allele of rpsl will be induced in the presence of IPTG which inactivates the lac repressor. Under IPTG 
induction rpsL protein will be expressed as long as the recombinant gene also possesses a ribosomal binding site, also 
known as a "Shine-Dalgarno Sequence". The trans copy of rpsl is cloned on a plasmid that is compatible with pSCIOI. 
Compatible vectors include p15A, pBR322, and the pUC plasmids, among others. Replication of the compatible plasmid 
will not be temperature-sensitive. The entire process of integrating the null allele of rpsL and subsequent plasmid excision 
is carried out in the presence of IPTG to ensure the expression of functional rpsl protein is maintained throughout. After 
the null rpsl allele is confirmed as integrated on the chromosome in place of the wild type rpsl allele, then IPTG is 
withdrawn and expression of functional rpsl protein shut off. If the rpsl gene is essential, cells will cease to proliferate 
under these conditions. However, if the rpsl gene is not essential, cells will continue to proliferate under these conditions. 
In this experiment, essentiality is determined by conditional expression of a wild type copy of the gene rather than inability 
to obtain the intended chromosomal disruption. 

An advantage of this method over some other gene disruption techniques is that the targeted gene can be deleted 
or mutated without the introduction of large segments of foreign DNA. Therefore, polar effects on downstream genes are 
eliminated or minimized. There are methods described to introduce inducible promoters upstream of potential essential 
bacterial genes. However in such cases, polarity from multiple transcription start points can be a problem. One way of 
preventing this is to insert a gene disruption cassette that contains strong transcriptional terminators upstream of the 
integrated inducible promoter (Zhang, Y, and Cronan, J. E. 1996 J. Bacteriol. 178 (12): 3614-3620). The described 
techniques will all be familiar to one of ordinary skill in the art. 
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Following the analysis of the rpsL gene, the other genes of the operon are investigated to determine if they are 
required for proliferation. 

EXAMPLE 6 

Expression of the Proteins Encoded hv Ge nes Identified as Required for £ coli Prnlifwatinn 
The following is provided as one exemplary method to express the proliferation-required proteins encoded by the 
identified sequences described above. First, the initiation and termination codons for the gene are identified. If desired, 
methods for improving translation or expression of the protein are well known in the art. For example, if the nucleic acid 
encoding the polypeptide to be expressed lacks a methionine codon to serve as the initiation site, a strong Shine-Delgamo 
sequence, or a stop codon, these sequences can be added. Similarly, if the identified nucleic acid sequence lacks a transcription 
termination signal, this sequence can be added to the construct by. for example, splicing out such a sequence from an 
appropriate donor sequence. In addition, the coding sequence may be operably linked to a strong promoter or an inducible 
promoter if desired. The identified nucleic acid sequence or portion thereof encoding the polypeptide to be expressed is obtained 
by PCR from the bacterial expression vector or genome using oligonucleotide primers complementary to the identified nucleic 
. acid sequence or portion thereof and containing restriction endonuclease sequences for Nco\ incorporated into the 5' primer and 
BgM at the 5' end of the corresponding 3'-primer, taking care to ensure that the identified nucleic acid sequence is positioned in 
frame with the termination signal. The purified fragment obtained from the resulting PCR reaction is digested with Nco\ and 
BgM, purified and iigated to an expression vector. 

The Iigated product is transformed into DH5a or some other £ coli strain suitable for the over expression of potential 
proteins. Transformation protocols are well known in the art. For example, transformation protocols are described in: Current 
Protocols in Molecular Biology, Vol. 1, Unit 1.8, [Ausubel, et at.. Eds.) John Wiley & Sons, Inc. (1997). Positive 
transformants are selected after growing the transformed cells on plates containing 50-100 //g/ml Ampictflin (Sigma, St Louis, 
Missouri). In one embodiment, the expressed protein is held in the cytoplasm of the host organism. In an alternate embodiment, 
the expressed protein is released into the culture medium. In still another alternative, the expressed protein can be sequestered 
in the periplasmic space and liberated therefrom using any one of a number of cell lysis techniques known in the art. For 
example, the osmotic shock cell lysis method described in Chapter 16 of Current Protocols in Molecular Biology. VoL 2, 
(Ausubel, et al, Eds.) John Wiley & Sons, Inc. (1997). Each of these procedures can be used to express a proliferation-required 
protein. 

Expressed proteins, whether in the culture medium or liberated from the periplasmic space or the cytoplasm, are then 
purified or enriched from the supernatant using conventional techniques such as ammonium sulfate precipitation, standard 
chromatography, immunoprecipitation, immunochromatography, size exclusion chromatography, ion exchange chromatography, 
and HPLC. Alternatively,, the secreted protein can be in a sufficiently enriched or pure state in the supernatant or growth media 
of the host to permit it to be used for its intended purpose without further enrichment. The purity of the protein product 
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obtained can be assessed using techniques such as Coomassie or silver staining or using antibodies against the control protein. 
Coomassie and silver staining techniques are familiar to those skilled in the art. 

Antibodies capable of specifically recognizing the protein of interest can be generated using synthetic peptides using 
methods well known in the art. See, Antibodies: A Laboratory Manual, (Harlow and Lane. Eds.) Cold Spring Harbor 
Laboratory (1988). For example, 15-mer peptides having a sequence encoded by the appropriate identified gene sequence of 
interest or portion thereof can be chemically synthesized. The synthetic peptides are injected into mice to generate antibodies 
to the polypeptide encoded by the identified nucleic acid sequence of interest or portion thereof. Alternatively, samples of the 
protein expressed from the expression vectors discussed above can be purified and subjected to amino acid sequencing analysis 
to confirm the identity of the recombinant^ expressed protein and subsequently used to raise antibodies. An Example 
describing in detail the generation of monoclonal and polyclonal antibodies appears in Example 7. 

The protein encoded by the identified nucleic acid sequence of interest or portion thereof can be purified using 
standard immunochromatography techniques. In such procedures, a solution containing the secreted protein, such as the 
culture medium or a cell extract, is applied to a column having antibodies against the secreted protein attached to the 
chromatography matrix. The secreted protein is allowed to bind the immunochromatography column. Thereafter, the column is 
washed to remove non-specifically bound proteins. The specifically bound secreted protein is then released from the column 
and recovered using standard techniques. These procedures are well known in the art. 

In an alternative protein purification scheme, the identified nucleic acid sequence of interest or portion thereof can be 
incorporated into expression vectors designed for use in purification schemes employing chimeric polypeptides. In such 
strategies the coding sequence of the identified nucleic acid sequence of interest or portion thereof is inserted in-frame with the 
gene encoding the other half of the chimera. The other half of the chimera can be maltose binding protein (MBP) or a nickel 
binding polypeptide encoding sequence. A chromatography matrix having antibody to MBP or nickel attached thereto is then 
used to purify the chimeric protein. Protease cleavage sites can be engineered between the MBP gene or the nickel binding 
polypeptide and the identified expected gene of interest, or portion thereof. Thus, the two polypeptides of the chimera can be 
separated from one another by protease digestion. 

One useful expression vector for generating maltose binding protein fusion proteins is pMAL (New England Biolabs), 
which encodes the /ra/£gene. In the pMal protein fusion system, the cloned gene is inserted into a pMal vector downstream 
from the malE gene. This results in the expression of an MBP-fusion protein. The fusion protein is purified by affinity 
chromatography. These techniques as described are well known to those skilled in the art of molecular biology. 

EXAMPLE 7 

Production of an Antibody to an isolated £ coli Protein 
Substantially pure protein or polypeptide is isolated from the transformed cells as described in Example 6. The 
concentration of protein in the final preparation is adjusted, for example, by concentration on a 10,000 molecular weight cut off 
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AMICON filter device (MOIipore, Bedford, MA), to the level of a few micrograms*!!!. Monoclonal or polyclonal antibody to the 
protein can then be prepared as follows: 

Monoclonal Antibody Production by Hybridoma Fusion 

Monoclonal antibody to epitopes of any of the peptides identified and isolated as described can be prepared from 
murine hybridomas according to the classical method of Kohler, G. and Milstein, C, Nature 256:495 (1975) or any of the well- 
known derivative methods thereof. Briefly, a mouse is repetitively inoculated with a few micrograms of the selected protein or 
peptides derived therefrom over a period of a few weeks. The mouse is then sacrificed, and the antibody producing cells of the 
spleen isolated. The spleen cells are fused by means of polyethylene glycol with mouse myeloma cells, and the excess unfused 
cells destroyed by growth of the system on selective media comprising aminopterin (HAT media). The successfully fused cells 
are diluted and aliquots of the dilution placed in wells of a microliter plate where growth of the culture is continued Antibody, 
producing clones are identified by detection of antibody in the supernatant fluid of the wells by immunoassay procedures, such 
as ELISA, as described by Engvall, E., "Enzyme immunoassay ELISA and EMIT," Meth. Enzymol. 70:419 (1980), and 
derivative methods thereof. Selected positive clones can be expanded and their monoclonal antibody product harvested for use. 
Detailed procedures for monoclonal antibody production are described in Davis, L et aL Basic Methods in Molecular Biology 
Elsevier, New York. Section 21-2. 

Polyclonal Antibody Production by Immunization 

Polyclonal antiserum containing antibodies to heterogeneous epitopes of a single protein or a peptide can be prepared 
by immunizing suitable animals with the expressed protein or peptides derived therefrom described above, which can be 
unmodified or modified to enhance immunogenicity. Effective polyclonal antibody production is affected by many factors 
related both to the antigen and the host species. For example, small molecules tend to be less immunogenic than larger 
molecules and can require the use of carriers and adjuvant. Also, host animals vary in response to site of inoculations and dose, 
with both inadequate or excessive doses of antigen resulting in low titer antisera. Small doses (ng level) of antigen administered 
at multiple intradermal sites appears to be most reliable. An effective immunization protocol for rabbits can be found in 
Vaitukaitis, J. et aL J. Clin. Endocrinol. Metab. 33:988*991 (1971). 

Booster injections can be given at regular intervals, and antiserum harvested when antibody titer thereof, as 
determined semi-quantitatively, for example, by double immunodiffusion in agar against known concentrations of the antigen, 
begins to fall. See, for example, Ouchterlony, 0. et aL, Chap. 19 in: Handbook of Experimental Immunology D. Wier (ed) 
Blackwell (1973). Plateau concentration of antibody is usually in the range of 0.1 to 0.2 mg/ml of serum (about 12 M). 
Affinity of the antisera for the antigen is determined by preparing competitive binding curves, as described, for example, by 
Fisher, D., Chap. 42 in: Manual of Clinical Immunology, 2d Ed. (Rose and Friedman, Eds.) Amer. Soc. For Microbiol., 
Washington, D.C.(1980). 

Antibody preparations prepared according to either protocol are useful in quantitative immunoassays which determine ' 
concentrations of antigen-bearing substances in biological samples; they are also used semi-quantitativeiy or qualitatively to 
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identify the presence of antigen in a biological sample. The antibodies can also be used in therapeutic compositions for killing 
bacterial cells expressing the protein. 

EXAMPLE 8 

Screening Chemical I ihrarip? 

A. Protein-Based Assays 

Having isolated and expressed bacterial proteins shown to be required for bacterial proliferation, the present invention 
further contemplates the use of these expressed proteins in assays to screen libraries of compounds for potential drug 
candidates. The generation of chemical libraries is well known in the art. For example combinatorial chemistry can be used to 
generate a library of compounds to be screened in the assays described herein. A combinatorial chemical library is a collection 
of diverse chemical compounds generated by either chemical synthesis or biological synthesis by combining a number of 
chemical "building blocks" reagents. For example, a linear combinatorial chemical library such as a polypeptide library is formed 
by combining amino acids in every possible combination to yield peptides of a given length. Millions of chemical compounds 
theoretically can be synthesized through such combinatorial mixings of chemical building blocks. For example, one commentator 
observed that the systematic, combinatorial mixing of 100 interchangeable chemical building blocks results in the theoretical 
synthesis of 100 million tetrameric compounds or 10 billion pentameric compounds. (Gallop et al., "Applications of 
Combinatorial Technologies to Drug Discovery, Background and Peptide Combinatorial libraries," Journal of Medicinal 
Chemistry. Vol. 37, No. 9, 1233-1250 (19941. Other chemical libraries known to those in the art may also be used, including 
natural product libraries. 

Once generated, combinatorial libraries can be screened for compounds that possess desirable biological properties. 
For example, compounds which may be useful as drugs or to develop drugs would likely have the ability to bind to the target 
protein identified, expressed and purified as discussed above. Further, if the identified target protein is an enzyme, candidate 
compounds would likely interfere with the enzymatic properties of the target protein. Any enzyme can be a target protein. For 
example, the enzymatic function of a target protein can be to serve as a protease, nuclease, phosphatase, dehydrogenase, 
transporter protein, transcriptional enzyme, and any other type of enzyme known or unknown. Thus, the present invention 
contemplates using the protein products described above to screen combinatorial chemical libraries. 

Those in the art will appreciate that a number of techniques exist for characterizing target proteins in order to identify 
molecules useful for the discovery and development of therapeutics. For example, some techniques involve the generation and 
use of small peptides to probe and analyze target proteins both biochemically and genetically in order to identify and develop 
drug leads. Such techniques include the methods described in PCT publications No. W09935494, W09819162, W09954728, 
the disclosures of which are incorporated herein by reference in their entireties. 

In another example, the target protein is a serine protease and the substrate of the enzyme is known. The present 
example is directed towards the analysis of libraries of compounds to identify compounds that function as inhibitors of the 
target enzyme. First, a library of small molecules is generated using methods of combinatorial library formation well known in 
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the art. U.S. Patent NOs. 5,463,564 and 5,574, 656, to Agrafiotis, et aL, entitled "System and Method of Automatically 
Generating Chemical Compound with Desired Properties/ are two such teachings. Then the library compounds are screened to 
identify library compounds that possess desired structural and functional properties. U.S. Patent No. 5,684,71 1 also discusses 
a method for screening libraries. 

To illustrate the screening process, the combined target and chemical compounds of the library are exposed to and 
permitted to interact with the purified enzyme. A labeled substrate is added to the incubation. The label on the substrate is 
such that a detectable signal is emitted from metabolized substrate molecules. The emission of this signal permits one to 
measure the effect of the combinatorial library compounds on the enzymatic activity of target enzymes. The characteristics of 
each library compound is encoded so that compounds demonstrating activity against the enzyme can be analyzed and features 
common to the various compounds identified can be isolated and combined into future iterations of libraries. 

Once a library of compounds is screened, subsequent libraries are generated using those chemical building blocks that 
possess the features shown in the first round of screen to have activity against the target enzyme. Using this method, 
subsequent iterations of candidate compounds will possess more and more of those structural and functional features required 
to inhibit the function of the target enzyme, until a group of enzyme inhibitors with high specificity for the enzyme can be found 
These compounds can then be further tested for their safety and efficacy as antibiotics for use in mammals. 

It will be readily appreciated that this particular screening methodology is exemplary only. Other methods are 
well known to those skilled in the art. For example, a wide variety of screening techniques are known for a large number 
of naturally-occurring targets when the biochemical function of the target protein is known. 
B. Cell Based Assays 

Current cell-based assays used to identify or to characterize compounds for drug discovery and development 
frequently depend on detecting the ability of a test compound to inhibit the activity of a target molecule located within a 
ceil or located on the surface of a cell. Most often such target molecules are proteins such as enzymes, receptors and the 
like. However, target molecules may also include other molecules such as DNAs, lipids, carbohydrates and RNAs including 
messenger RNAs, ribosomal RNAs, tRNAs and the like. A number of highly sensitive cell-based assay methods are 
available to those of skill in the art to detect binding and interaction of test compounds with specific target molecules. 
However, these methods are generally not highly effective when the test compound binds to or otherwise interacts with 
its target molecule with moderate or low affinity. In addition, the target molecule may not be readily accessible to a test 
compound in solution, such as when the target molecule is located inside the cell or within a cellular compartment such as 
the periplasm of a bacterial cell. Thus, current cell-based assay methods are limited in that they are not effective in 
identifying or characterizing compounds that interact with their targets with moderate to low affinity or compounds that 
interact with targets that are not readily accessible. 

Cell-based assay methods of the present invention have substantial advantages over current cell-based assays 
practiced in the art. These advantages derive from the use of sensitized cells in which the level or activity of a 
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proliferation-required gene product (the target molecule) has been specifically reduced to the point where the presence or 
absence of its function becomes a rate-determining step for cellular proliferation. Bacterial, fungal, plant, or animal cells 
can all be used with the present method. Such sensitized cells become much more sensitive to compounds that are active 
against the affected target molecule. Thus, cell-based assays of the present invention are capable of detecting compounds 
exhibiting low or moderate potency against the target molecule of interest because such compounds are substantially more 
potent on sensitized cells than on non-sensitized cells. The affect may be such that a test compound may be two to 
several times more potent, at least 10 times more potent or even at least 100 times more potent when tested on the 
sensitized cells as compared to the non-sensitized cells. 

Due in part to the increased appearance of antibiotic resistance in pathogenic microorganisms and to the 
significant side-effects associated with some currently used antibiotics, novel antibiotics acting at new targets are highly 
sought after in the art. Yet, another limitation in the current art related to cell-based assays is the problem of identifying 
hits against the same kinds of target molecules in the same limited set of biological pathways over and over again. This 
may occur when compounds acting at such new targets are discarded, ignored or fail to be detected because compounds 
acting at the "old" targets are encountered more frequently and are more potent than compounds acting at the new 
targets. As a result, the majority of antibiotics in use currently interact with a relatively small number of target molecules 
within an even more limited set of biological pathways. 

The use of sensitized cells of the current invention provides a solution to the above problem in two ways. First, 
desired compounds acting at a target of interest, whether a new target or a previously known but poorly exploited target, 
can now be detected above the "noise" of compounds acting at the "old" targets due to the specific and substantial 
increase in potency of such desired compounds when tested on the sensitized cells of the current invention. Second, the 
methods used to sensitize cells to compounds acting at a target of interest may also sensitize these cells to compounds 
acting at other target molecules within the same biological pathway. For example, expression of an antisense molecule 
to a gene encoding a ribosomai protein is expected to sensitize the cell to compounds acting at that ribosomai protein and 
may also sensitize the ceils to compounds acting at any of the ribosomai components (proteins or rRNA) or even to 
compounds acting at any target which is part of the protein synthesis pathway. Thus an important advantage of the 
present invention is the ability to reveal new targets and pathways that were previously not readily accessible to drug 
discovery methods. 

Sensitized cells of the present invention are prepared by reducing the activity or level of a target molecule. The 
target molecule may be a gene product, such as an RNA or polypeptide produced from the proliferation-required nucleic 
acids described herein. Alternatively, the target may be a gene product such as an RNA or polypeptide which is produced 
form a sequence within the same operon as the proliferation-required nucleic acids described herein, in addition, the target 
may be an RNA or polypeptide in the same biological pathway as the proliferation-required nucleic acids described herein. 
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Such biological pathways include, but are not limited to, enzymatic, biochemical and metabolic pathways as well as 
pathways involved in the production of cellular structures such the ceil wall. 

Current methods employed in the arts of medicinal and combinatorial chemistries are able to make use of 
structure-activity relationship information derived from testing compounds in various biological assays including direct 
binding assays and cell-based assays. Occasionally compounds are directly identified in such assays that are sufficiently 
potent to be developed as drugs. More often, initial hit compounds exhibit moderate or low potency. Once a hit compound 
is identified with low or moderate potency, directed libraries of compounds are synthesized and tested in order to identify 
more potent leads. Generally these directed libraries are combinatorial chemical libraries consisting of compounds with 
structures related to the hit compound but containing systematic variations including additions, subtractions and 
substitutions of various structural features. When tested for activity against the target molecule, structural features are 
identified that either alone or in combination with other features enhance or reduce activity. This information is used to 
design subsequent directed libraries containing compounds with enhanced activity against the target molecule. After one 
or several iterations of this process, compounds with substantially increased activity against the target molecule are 
identified and may be further developed as drugs. This process is facilitated by use of the sensitized ceils of the present 
invention since compounds acting at the selected targets exhibit increased potency in such cell-based assays, thus; more 
compounds can now be characterized providing more useful information than would be obtained otherwise. 

Thus, it is now possible using cell-based assays of the present invention to identify or characterize compounds 
that previously would not have been readily identified or characterized including compounds that act at targets that 
previously were not readily exploited using cell-based assays. The process of evolving potent drug leads from initial hit 
compounds is also substantially improved by the cell-based assays of the present invention because, for the same number 
of test compounds, more structure-function relationship information is likely to be revealed. 

The method of sensitizing a cell entails selecting a suitable gene or operon. A suitable gene or operon is one whose 
expression is required for the proliferation of the ceil to be sensitized. The next step is to introduce into the cells to be 
sensitized, an antisense RNA capable of hybridizing to the suitable gene or operon or to the RNA encoded by the suitable 
gene or operon. Introduction of the antisense RNA can be in the form of an expression vector in which antisense RNA is 
produced under the control of an inducible promoter. The amount of antisense RNA produced is limited by varying the 
inducer concentration to which the cell is exposed and thereby varying the activity of the promoter driving transcription of 
the antisense RNA. Thus, cells are sensitized by exposing them to an inducer concentration that results in a sub-lethal 
level of antisense RNA expression. 

In one embodiment of the cell-based assays, the identified exogenous £ coli nucleotide sequences of the present 
invention are used to inhibit the production of a proliferation-required protein. Expression vectors producing antisense RNA 
against identified genes required for proliferation are used to limit the concentration of a proliferation-required protein 
without severly inhibiting growth. To achieve that goal, a growth inhibition dose curve of inducer is calculated by plotting 
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various doses of inducer against the corresponding growth inhibition caused by the antisense expression. From this curve, 
various percentages of antisense induced growth inhibition, from 1 to 100% can be determined. If the promoter contained 
in the expression vector contains a lac operator the transcription is regulated by lac repressor and expression from the 
promoer is inducible with IPTG. For example, the highest concentration of the inducer IPTG that does not reduce the 
growth rate (0% growth inhibition) can be predicted from the curve. Cellular proliferation can be monitored by growth 
medium turbidity via OD measurements, in another example, the concentration of inducer that reduces growth by 25% 
can be predicted from the curve. In still another example, a concentration of inducer that reduces growth by 50% can be 
calculated. Additional parameters such as colony forming units (cfu) can be used to measure cellular viability. 

Cells to be assayed are exposed to the above-determined concentrations of inducer. The presence of the inducer at 
this sub-lethal concentration reduces the amount of the proliferation required gene product to the lowest amount in the cell 
that will support growth. Cells grown in the presence of this concentration of inducer are therefore specifically more 
sensitive to inhibitors of the proliferation-required protein or RNA of interest or to inhibitors of proteins or RNAs in the same 
biological pathway as the proliferation-required protein or RNA of interest but not to inhibitors of unrelated proteins or 
RNAs. 

Cells pretreated with sub-inhibitory concentrations of inducer and thus containing a reduced amount of 
proliferation-required target gene product are then used to screen for compounds that reduce cell growth. The sub-lethal 
concentration of inducer may be any concentration consistent with the intended use of the assay to identify candidate 
compounds to which the cells are more sensitive. For example, the sub-lethal concentration of the inducer may be such 
that growth inhibition is at least about 5%, at least about 8%, at least about 10%, at least about 20%, at least about 
30%, at least about 40%, at least about 50%, at least about 60% at least about 75%, or more. Cells which are pre- 
sensitized using the preceding method are more sensitive to inhibitors of the target protein because these cells contain less 
target protein to inhibit than wild-type cells. 

In another embodiment of the cell based assays of the present invention, the level or activity of a proliferation required 
gene product is reduced using a temperature sensitive ...mutation in the proliferation-required sequence and an antisense 
nucleic acid against the proliferation-required sequence. Growing the cells at an intermediate temperature between the 
permissive and restrictive temperatures of the temperature sensitive mutant where the mutation is in a proliferation- 
required gene produces cells with reduced activity of the proliferation-required gene product. The antisense RNA directed 
against the proliferation-required sequence further reduces the activity of the proliferation required gene product. Drugs 
that may not have been found using either the temperature sensitive mutation or the antisense nucleic acid alone may be 
identified by determining whether cells in which expression of the antisense nucleic acid has been induced and which are 
grown at a temperature between the permissive temperature and the restrictive temperature are substantially more 
sensitive to a test compound than cells in which expression of the antisense nucleic acid has not been induced and which 
are grown at a permissive temperature. Also drugs found previously from either the antisense nucleic acid alone or the 
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temperature sensitive mutation alone may have a different sensitivity profile when used in cells combining the two 
approaches, and that sensitivity profile may indicate a more specific action of the drug in inhibiting one or more activities 
of the gene product. 

Temperature sensitive mutations may be located at different sites within the gene and correspond to different 
domains of the protein. For example, the dnaB gene of Escherichia coii encodes the replication fork DNA helicase. DnaB 
has several domains, including domains for oligomerization, ATP hydrolysis, DNA binding, interaction with primase, 
interaction with DnaC, and interaction with OnaA [(Biswas, E.E. and Biswas, S.B. 1999. Mechanism and DnaB helicase of 
Escherichia coir, structural domains involved in ATP hydrolysis, DNA binding, and oligomerization. Biochem. 38:10919- 
10928; Hiasa, H. and Marians, K.J. 1999. Initiation of bidirectional replication at the chromosomal origin is directed by the 
interaction between helicase and primase. J. Biol. Chem. 274:27244-27248; San Martin, C, Radermacher, M., 
Wolpensinger, B., Engei, A., Miles, C.S., Dixon, N.E., and Carazo, J.M. 1998. Three-dimensional reconstructions from 
cryoelectron microscopy images reveal an intimate complex between helicase DnaB and its loading partner DnaC. 
Structure 6:501-9; Sutton, M.D., Carr, K.M., Vicente, M., and Kaguni, J.M. 1998. Escherichia ^//DnaA protein. The N- 
terminal domain and loading of DnaB helicase at the E fl?// chromosomal. J. Biol. Chem. 273:34255-62.), the disclosures 
of which are incorporated herein by reference in their entireties]. Temperature sensitive mutations in different domains of 
DnaB confer different phenotypes at the restrictive temperature, which include either an abrupt stop or slow stop in DNA 
replication with or without DNA breakdown (Wechsler, J.A. and Gross, J.D. 1971. Escherichia coii mutants temperature- 
sensitive for DNA synthesis. Mol. Gen. Genetics 113:273-284, the disclosure of which is incorporated herein by reference 
in its entirety) and termination of growth or cell death. Combining the use of temperature sensitive mutations in the dnaB 
gene that cause cell death at the restrictive temperature with an antisense to the dnaB gene could lead to the discovery of 
very specific and effective inhibitors of one or a subset of activities exhibited by DnaB. 

When screening for antimicrobial agents against a gene product required for proliferation, growth inhibition of cells 
containing a limiting amount of that proliferation-required gene product can be assayed. Growth inhibition can be 
measured by directly comparing the amount of growth, measured by the optical density of the growth medium, between an 
experimental sample and a control sample. Alternative methods for assaying cell proliferation include measuring green 
fluorescent protein (GFP) reporter construct emissions, various enzymatic activity assays, and other methods well known 
in the art. 

It will be appreciated that the above method may be performed in solid phase, liquid phase or a combination of 
the two. For example, cells grown on nutrient agar containing the inducer of the antisense construct may be exposed to 
compounds spotted onto the agar surface. A compound's effect may be judged from the diameter of the resulting killing 
zone, the area around the compound application point in which cells do not grow. Multiple compounds may be transferred 
to agar plates and simultaneously tested using automated and semi-automated equipment including but not restricted to 
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multkhannel pipettes (for example the Beckman Multimek) and multi channel spotters (for example the Genomic Solutions 
Flexys). In this way multiple plates and thousands to millions of compounds may be tested per day. 

The compounds may also be tested entirely in liquid phase using microtiter plates as described below. Liquid 
phase screening may be performed in microtiter plates containing 96, 384, 1536 or more wells per microtiter plate to 
screen multiple plates and thousands to millions of compounds per day. Automated and semiautomated equipment may be 
used for addition of reagents (for example cells and compounds) and determination of cell density. 

EXAMPLE 9 

The effectiveness of the above cell based assay was validated using constructs expressing antisense RNA to £ 
coli genes rpIL, rplJ, and rplW encoding ribosomal proteins L7/L1 2, L10 and L23 respectively. These proteins are part of 
the protein synthesis apparatus of the cell and as such are required for proliferation. These constructs were used to test 
the effect of antisense expression on cell sensitivity to antibiotics known to bind to the ribosome and thereby inhibit 
protein synthesis. Constructs expressing antisense RNA to several other genes (elaD, visC, yohH, and aptE/B), the 
products of which are not involved in protein synthesis were used for comparison. 

First expression vectors containing antisense constructs to either rplW or to elaD were introduced into separate 
£ a?// cell populations. Vector introduction is a technique well known to those of ordinary skill in the art. The expression 
vectors of this example contain IPTG inducible promoters that drive the expression of the antisense RNA in the presence of 
the inducer. However, those skilled in the art will appreciate that other inducible promoters may also be used. Suitable 
expression vectors are also well known in the art. The£ coli antisense clones encoding ribosomal proteins L7/L12, L10 
and L23 were used to test the effect of antisense expression on cell sensitivity to the antibiotics known to bind to these 
proteins. First, expression vectors containing antisense to either the genes encoding L7/L12 and L10 or L23 were 
introduced into separate E. coli cell populations. 

The cell populations were exposed to a range of IPTG concentrations in liquid medium to obtain the growth 
inhibitory dose curve for each clone (Fig. 1). First, seed cultures were grown to a particular turbidity that is measured by 
the optical density (OD) of the growth solution. The OD of the solution is directly related to the number of bacterial cells 
contained therein. Subsequently, sixteen 200 ul liquid medium cultures were grown in a 96 well microtiter plate at 37 C 
with a range of IPTG concentrations in duplicate two-fold serial dilutions from 1600 uM to 12.5 uM (final concentration). 
Additionally, control cells were grown in duplicate without IPTG. These cultures were started from equal amounts of cells 
derived from the same initial seed culture of a clone of interest. The cells were grown for up to 15 hours and the extent of 
growth was determined by measuring the optical density of the cultures at 600 nm. When the control culture reached mid- 
log phase the percent growth of the control for each of the IPTG containing cultures was plotted against the log 
concentrations of IPTG to produce a growth inhibitory dose response curve for the IPTG. The concentration of IPTG that 
inhibits cell growth to 50% (IC^) as compared to the 0 mM IPTG control (0% growth inhibition) was then calculated from 
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the curve. Under these conditions, an amount of antisense RNA was produced that reduced the expression levels of rpIW 
and elaO to a degree such that growth was inhibited by 50%. 

Alternative methods of measuring growth are also contemplated. Examples of these methods include 
measurements of proteins, the expression of which is engineered into the cells being tested and can readily be measured. 
Examples of such proteins include green fluorescent protein (GFP) and various enzymes. 

Cells were pretreated with the selected concentration of IPTG and then used to test the sensitivity of cell 
populations to tetracycline, erythromycin and other protein synthesis inhibitors. An example of a tetracycline dose 
response curve is shown in Figures 2A and 2B for the rpIW and elaD genes, respectively. Cells were grown to log phase 
and then diluted into media alone or media containing IPTG at concentrations which give 20% and 50% growth inhibition 
as determined by IPTG dose response curves. After 2.5 hours, the cells were diluted to a final OD600 of 0.002 into 96 
well plates containing (1) +/• IPTG at the same concentrations used for the 15 hour pre-incubation; and (2) serial two-fold 
dilutions of tetracycline such that the final concentrations of tetracycline range from 1 ^ig/ml to 15.6 ng/ml and 0 |ig/mL 
The 96 well plates were incubated at 37°C and the 0D600 was read by a plate reader every 5 minutes for up to 15 hours. 
Hi each IPTG concentration and the no IPTG control, tetracycline dose response curves were determined when the control 
(absence of tetracycline) reached 0.1 0D600. To compare tetracycline sensitivity with and without IPTG, tetracycline 
IC50s were determined from the dose response curves (Figs. 2A-B). Cells with reduced levels of L23 (rpIW) showed 
increased sensitivity to tetracycline (Fig. 2A) as compared to cells with reduced levels of elaD (Fig. 2B). Figure 3 shows a 
summary bar chart in which the ratios of tetracycline IC50s determined in the presence of IPTG which gives 50% growth 
inhibition versus tetracycline IC50s determined without IPTG (fold increase in tetracycline sensitivity) were plotted. Cells 
with reduced levels of either L7/L12 (genes rpIL, rplJ) or L23 \rplW) showed increased sensitivity to tetracycline (Fig. 3). 
Cells expressing antisense to genes not known to be involved in protein synthesis [atpB/E t visC, elaD, yohH\ did not show 
the same increased sensitivity to tetracycline, validating the specificity of this assay (Fig. 3). 

In addition to the above, it has been observed in initial experiments that clones expressing antisense RNA to 
genes involved in protein synthesis (including genes encoding ribosomal proteins L7/L12 & L10, L7/L12 alone, L22, and 
L18, as well as genes encoding rRNA and Elongation Factor G) have increased sensitivity to the macrolide, erythromycin, 
whereas clones expressing antisense to the non-protein synthesis genes elaD, atpB/E and visC do not. Furthermore, the 
clone expressing antisense to rpIL and rplJ does not show increased sensitivity to nalidixic acid and ofloxacin, antibiotics 
which do not inhibit protein synthesis. 

The results with the ribosomal protein genes rpIL, rplJ, and rpIW as well as the initial results using various other 
antisense clones and antibiotics show that limiting the concentration of an antibiotic target makes cells more sensitive to 
the antimicrobial agents that specifically interact with that protein. The results also show that these cells are sensitized 
to antimicrobial agents that inhibit the overall function in which the protein target is involved but are not sensitized to 
antimicrobial agents that inhibit other functions. 
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The cell based assay described above may also be used to identify the biological pathway in which a 
proliferation-required nucleic acid or its gene product lies. In such methods, cells expressing a sub-lethal level of antisense 
to a target proliferation-required nucleic acid and control cells in which expression of the antisense has not been induced 
are contacted with a panel of antibiotics known to act in various pathways. If the antibiotic acts in the pathway in which 
the target proliferation-required nucleic acid or its gene product lies, cells in which expression of the antisense has been 
induced will be more sensitive to the antibiotic than cells in which expression of the antisense has not been induced. 

As a control, the results of the assay may be confirmed by contacting a panel of cells expressing antisense 
nucleic acids to many different proliferation-required genes including the target proliferation-required gene. If the antibiotic 
is acting specifically, heightened sensitivity to the antibiotic will be observed only in the cells expressing antisense to a 
target proliferation-required gene (or cells expressing antisense to other proliferation-required genes in the same pathway 
as the target proliferation-required gene) but will not be observed generally in all cells expressing antisense to proliferation- 
required genes. 

Similarly, the above method may be used to determine the pathway on which a test antibiotic acts. A panel of 
cells, each of which expresses antisense to a proliferation-required nucleic acid in a known pathway, is contacted with a 
compound for which it is desired to determine the pathway on which it acts. The sensitivity of the panel of cells to the 
test compound is determined in ceils in which expression of the antisense has been induced and in control cells in which 
expression of the antisense has not been induced. If the test antibiotic acts on the pathway on which an antisense nucleic 
acid acts, cells in which expression of the antisense has been induced will be more sensitive to the antibiotic than cells in 
which expression of the antisense has not been induced. In addition, control cells in which expression of antisense to 
proliferation-required genes in other pathways has been induced will not exhibit heightened sensitivity to the antibiotic. In 
this way, the pathway on which the test antibiotic acts may be determined. 

The Example below provides one method for performing such assays. 

EXAMPLE 10 

Identification of thePathwav in which a Proliferation-Required 
Gene Lies or the Pathway on whic h an Antibiotic Acts 
A. Preparation of B acterial Stocks for Assay 

To provide a consistent source of cells to screen, frozen stocks of host bacteria containing the desired antisense 
construct are prepared using standard microbiological techniques. For example, a single clone of the organism can be 
isolated by streaking out a sample of the original stock onto an agar plate containing nutrients for cell growth and an 
antibiotic for which the antisense construct contains a gene which confers resistance. After overnight growth an isolated 
colony is picked from the plate with a sterile needle and transferred to an appropriate liquid growth media containing the 
antibiotic required for maintenance of the plasmid. The cells are incubated at 30°C to 37°C with vigorous shaking for 4 to 
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6 hours to yield a culture in exponential growth. Sterile glycerol is added to 15% (volume to volume) and 100*iL to 500 
aliquots are distributed into sterile cryotubes, snap frozen in liquid nitrogen, and stored at -80°C for future assays. 

B. Growth of Bacteria for Use in the Assay 

A day prior to an assay, a stock vial is removed from the freezer, rapidly thawed (37°C water bath) and a loop of 
culture is streaked out on an agar plate containing nutrients for cell growth and an antibiotic to which the antisense 
construct confers resistance. After overnight growth at 37°C, ten randomly chosen, isolated colonies are transferred from 
the plate (sterile inoculum loop) to a sterile tube containing 5 ml of LB medium containing the antibiotic to which the 
antisense vector confers resistance. After vigorous mixing to form a homogeneous ceil suspension, the optical density of 
the suspension is measured at BOO nm (OD600) and if necessary an aliquot of the suspension is diluted into a second tube 
of 5 mU sterile, LB medium plus antibiotic to achieve an OD600 <; 0.02 absorbance units. The culture is then incubated at 
37° C for 1-2 hrs with shaking until the 0D600 reaches OD 0.2 - 0.3. At this point the ceils are ready to be used in the 
assay. 

C. Selection of Media to be Used in Assay 

Two fold dilution series of the inducer are generated in culture media containing the appropriate antibiotic for 
maintenance of the antisense construct. Several media are tested side by side and three to four wells are used to evaluate 
the effects of the inducer at each concentration in each media. For example, M9 minimal media, LB broth, TBD broth and 
Muller Hinton media may be tested with the inducer IPTG at the following concentrations, 50 jiM, 100 jiM, 200 ^M, 400 
HM, 600 nM, 800 \iM and 1000 jiM. Equal volumes of test media-inducer and cells are added to the wells of a 384 weU 
microliter plate and mixed. The cells are prepared as described above and diluted 1:100 in the appropriate media 
containing the test antibiotic immediately prior to addition to the microtiter plate wells. For a control, cells are also added 
to several wells of each media that do not contain inducer, for example 0 M IPTG. Cell growth is monitored continuously 
by incubation at 37°C in a microtiter plate reader monitoring the 00600 of the wells over an 18-hour period. The percent 
inhibition of growth produced by each concentration of inducer is calculated by comparing the rates of logarithmic growth 
against that exhibited by cells growing in media without inducer. The medium yielding greatest sensitivity to inducer is 
selected for use in the assays described below. 

D. Measurement of Test Antibio tic Sensitivity in the Absence of Antisense Construct Induction 

Two-fold dilution series of antibiotics of known mechanism of action are generated in the culture media selected 
for further assay development that has been supplemented with the antibiotic used to maintain the construct. A panel of 
test antibiotics known to act on different pathways is tested side by side with three to four wells being used to evaluate 
the effect of a test antibiotic on cell growth at each concentration. Equal volumes of test antibiotic and cells are added to 
the wells of a 384 well microtiter plate and mixed. Cells are prepared as described above using the media selected for 
assay development supplemented with the antibiotic required to maintain the antisense construct and are diluted 1:100 in 
identical media immediately prior to addition to the microtiter plate wells. For a control, cells are also added to several 
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wells that contain the solvent used to dissolve the antibiotics but no antibiotic. Cell growth is monitored continuously by 
incubation at 37°C in a microtiter plate reader monitoring the OD600 of the wells over an 18-hour period. The percent 
inhibition of growth produced by each concentration of antibiotic is calculated by comparing the rates of logarithmic 
growth against that exhibited by cells growing in media without antibiotic. A plot of percent inhibition against 
log[antibiotic concentration) allows extrapolation of an IC 50 value for each antibiotic. 
E - Measurement of Test Antibiotic Sensitivity in t he Presence of Antisense Construct Inducer 

The culture media selected for use in the assay is supplemented with inducer at concentrations shown to inhibit 
cell growth by 50 and 80% as described above and the antibiotic used to maintain the construct. Two fold dilution series 
of the panel of test antibiotics used above are generated in each of these media. Several antibiotics are tested side by side 
with three to four wells being used to evaluate the effects of an antibiotic on cell growth at each concentration, in each 
media. Equal volumes of test antibiotic and cells are added to the wells of a 384 well microtiter plate and mixed. Cells are 
prepared as described above using the media selected for use in the assay supplemented with the antibiotic required to 
maintain the antisense construct. The cells are diluted 1:100 into two 50 ml aliquots of identical media containing 
concentrations of inducer that have been shown to inhibit cell growth by 50% and 80 % respectively and incubated at 
37°C with shaking for 2.5 hours. Immediately prior to addition to the microtiter plate wells, the cultures are adjusted to 
an appropriate 0D 600 (typically 0.002) by dilution into warm (37°C) sterile media supplemented with identical 
concentrations of the inducer and antibiotic used to maintain the antisense construct. For a control, cells are also added 
to several wells that contain solvent used to dissolve test antibiotics but which contain no antibiotic. Cell growth is 
monitored continuously by incubation at 37°C in a microtiter plate reader monitoring the OD600 of the wells over an 18- 
hour period. The percent inhibition of growth produced by each concentration of antibiotic is calculated by comparing the 
rates of logarithmic growth against that exhibited by cells growing in media without antibiotic. A plot of percent inhibition 
against log[antibiotic concentration] allows extrapolation of an ICso value for each antibiotic. 
F. Determining the Specificity of the Test Antibiotics 

A comparison of the IC so s generated by antibiotics of known mechanism of action under antisense induced and 
non-induced conditions allows the pathway in which a proliferation-required nucleic acid lies to be identified. If cells 
expressing an antisense nucleic acid against a proliferation-required gene are selectively sensitive to an antibiotic acting 
via a particular pathway, then the gene against which the antisense acts is involved in the pathway in which the antibiotic 
acts. 

G- Identification of Pathway in which a Test Antibiotic Acts 

As discussed above, the cell based assay may also be used to determine the pathway against which a test 
antibiotic acts. In such an analysis, the pathways against which each member of a panel of antisense nucleic acids acts 
are identified as described above. A panel of cells, each containing an inducible antisense vector against a gene in a 
known proliferation-required pathway, is contacted with a test antibiotic for which it is desired to determine the pathway 
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on which it acts under inducing an non-inducing conditions. If heightened sensitivity is observed in induced cells expressing 
antisense against a gene in a particular pathway but not in induced cells expressing antisense against genes in other 
pathways, then the test antibiotic acts against the pathway for which heightened sensitivity wes observed. 

One skilled in the art will appreciate that further optimization of the assay conditions, such as the concentration 
of inducer used to induce antisense expression and/or the growth conditions used for the assay (for example incubation 
temperature and media components) may further increase the selectivity end/or magnitude of the antibiotic sensitization 
exhibited. 

The following example confirms the effectiveness of the methods described above. 

EXAMPLE 11 

Identification of the Pathway in which a Proliferation-Required Gene Lies 
Antibiotics of various chemical classes and modes of action were purchased from Sigma Chemicals (St. Louis. 
MO). Stock solutions were prepered by dissolving each antibiotic in an appropriate aqueous solution based on information 
provided by the manufacturer. The final working solution of each antibiotic contained no more than 0.2% (wfv) of any 
organic solvent. To determine their potency against a bacterial strain engineered for expression of an antisense against a 
proliferation-required BOS ribosomal protein, each antibiotic was serially diluted two or three fold in growth medium 
supplemented with the appropriate antibiotic for maintenance of the anti-sense construct. At least ten dilutions were 
prepared for eech antibiotic. 25 nL aliquots of each dilution were transferred to discrete wells of a 384-well microplate 
(the assay plate) using a multi-channel pipette. Quadruplicate wells were used for each dilution of an antibiotic under each 
treatment condition (plus and minus inducer). Each assay plate contained twenty wells for cell growth controls (growth 
media replacing antibiotic), ten wells for each treatment (plus and minus inducer, in this example IPTG). Assey plates were 
usuelly divided into the two treatments: half the plate containing induced cells end an appropriate concentrations of 
inducer (in this example IPTG) to maintain the state of induction, the other half containing non-induced cells in the ebsence 
of IPTG. 

Cells for the assay were prepared as follows. Bacterial cells containing a construct, from which expression of 
antisense nucleic acid against rpIL and rplJ, which encode proliferation-required 50S ribosomal subunit proteins, is 
inducible in the presence of IPTG, were grown into exponential growth {0D m 0.2 to 0.3) and then diluted 1:100 into fresh 
media containing either 400 nM or 0 nM inducer (IPTG). These cultures were incubated at 37° C for 2.5 hr. After a 2.5 
hr incubation, induced and non-induced cells were respectively diluted into an assay medium at a final 0D m value of 
0.0004. The medium contained an appropriate concentration of the antibiotic for the maintenance of the anti-sense 
construct. In addition, the medium used to dilute induced cells was supplemented with 800 nM IPTG so that addition to 
the assay plate would result in e final IPTG concentration. of. 400 uNI. Induced and non-induced cell suspensions were 
dispensed (25 nl/well) into the appropriate wells of the assay plate as discussed previously. The plate was then loaded 
into a plate reader, incubated at constant temperature, and cell growth was monitored in each well by the measurement of 
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light scattering at 595 nm. Growth was monitored every 5 minutes until the cell culture attained a stationary growth 
phase. For each concentration of antibiotic, a percentage inhibition of growth was calculated at the time point 
corresponding to mid-exponential growth for the associated control wells (no antibiotic, plus or minus IPTG). For each 
antibiotic and condition (plus or minus IPTG). a plot of percent inhibition versus log of antibiotic concentration was 
generated and the IC50 determined. A comparison of the IC 5a for each antibiotic in the presence and absence of IPTG 
revealed whether induction of the amisense construct sensitized the cell to the mechanism of action exhibited by the 
antibiotic. Cells which exhibited a significant (standard statistical analysis) numerical decrease in the IC 50 value in the 
presence of inducer were considered to have an increased sensitivity to the test antibiotic. 

The results are provided in the table below, which lists the classes and names of the antibiotics used in the 
analysis, the targets of the antibiotics, the IC50 in the absence of IPTG, the IC50 in the presence of IPTG, the 
concentration units for the IC50s, the fold increase in IC50 in the presence of IPTG, and whether increased sensitivity was 
observed in the presence of IPTG. 
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The above results demonstrate that induction of an antisense RNA to genes encoding 50S ribosomal subunit 
proteins results in a selective and highly significant sensitization of cells to antibiotics that inhibit ribosomal function and 
protein synthesis. The above results further demonstrate that induction of an antisense construct to an essential gene 
sensitizes an organism to compounds that interfere with that gene products' biological role. This sensitization is restricted 
to compounds that interfere with pathways associated with the targeted gene and it's product. 

Assays utilizing antisense constructs to essential genes can be used to identify compounds that specifically 
interfere with the activity of multiple targets in a pathway. Such constructs can be used to simultaneously screen a 
sample against multiple targets in one pathway in one reaction (Combinatorial HTS). 

Furthermore, as discussed above, panels of antisense construct containing cells may be used to characterize the 
point of intervention of any compound affecting an essential biological pathway including antibiotics with no known 
mechanism of action. 

Another embodiment of the present invention is a method for determining the pathway against which a test 
antibiotic compound is active in which the activity of target proteins or nucleic acids involved in proliferation-required 
pathways is reduced by contacting cells with a sublethal concentration of a known antibiotic which acts against the 
target protein or nucleic acid. In one embodiment, the target protein or nucleic acid is a target protein or nucleic acid 
corresponding to a proliferation-required nucleic acid identified using the methods described above. The method is similar 
to those described above for determining which pathway a test antibiotic acts against except that rather than reducing the 
activity or level of a proliferation-required gene product using a sublethal level of antisense to a proliferation-required 
nucleic acid, the activity or level of the proliferation-required gene product is reduced using sublethal level of a known 
antibiotic which acts against the proliferation required gene product. 

Interactions between drugs which affect the same biological pathway has been described in the literature. For 
example, Mecillinam (Amdinocillin) binds to and inactivates the penicillin binding protein 2 (PBP2. product of the mrdA in £ 
coh). This antibiotic inteacts with other antibiotics that inhibit PBP2 as well as antibiotics that inhibit other penicillin 
binding proteins such as PBP3 [(Gutmann. L, Vincent, S„ Billot-Klein, D., Acar, J.F., Mrena, E., and Williamson, R. (1986) 
Involvement of penicillin-binding protein 2 with other penicillin-binding proteins in lysis of Escherichia «?//by some beta- 
lactam antibiotics alone and in synergistic lytic effect of amdinocillin (mecillinam). Antimicrobial Agents & Chemotherapy, 
30:906-912), the disclosure of which is incorporated herein by reference in its entirety]. Interactions between drugs 
could, therefore, involve two drugs that inhibit the same target protein or nucleic acid or inhibit different proteins or 
nucleic acids in the same pathway [(Fukuoka, T., Domon, H., Kakuta, M., Ishii, C, Hirasawa, A., Utsui, Y., Ohya, S., and 
Yasuda, H. (1997) Combination effect between panipenem and vancomycin on highly methicillin-resistant Staphylococcus 
aureus. Japan. J. Antibio. 50:411-419; Smith, C.E., Foleno, B.E., Barrett, J.F., and Frosc, M.B. (1997) Assessment of the 
synergistic interactions of levofloxacin and ampicillin against Enterococcus faecium by the checkerboard agar dilution and 
time-kill methods. Oiagnos. Microbiol. Infect. Disease 27:85-92; den Hollander, J.G., Horrevorts, A.M., van Goor, M.L, 
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Verbrugh. H.A., and Mooton. J.W. (1997) Synergism between tobramycin and ceftazidime against a resistant 
Pseudomonas aeruginosa strain, tested in an in vitro pharmacokinetic model. Antimicrobial Agents & Chemotherapy. 
41:95-1 10), the disclosure of all of which are incorporated herein by reference in their entireties!. 

Two drugs may interact even though they inhibit different targets. For example, the proton pump inhibitor. 
Omeprazole, and the antibiotic. Amoxycillin, two synergistic compounds acting together, can cure Helicobacter pylori 
infection [{ Gabryelewicz, A., Laszewicz, W., Dzieniszewski, J., Ciok, J., Marlicz. K„ Bielecki, D., Popiela, T., Legutko, J., 
Knapik. Z.. Poniewierka, E. (1997) Multicenter evaluation of dual-therapy (omeprazol and amoxycillin) for Helicobacter 

/^associated duodenal and gastric ulcer (two years of the observation). J. PhysioL Pharmacol. 48 Suppl 4:93-105), the 

disclosure of which is incorporated herein by reference in its entirety]. 

The growth inhibition from the sublethal concentration of the known antibiotic may be at least about 5%, at 

least about 8%, at least about 10%, at least about 20%. at least about 30%, at least about 40%, at least about 50%, at 

least about 60%, or at least about 75%, or more. 

Alternatively, the sublethal concentration of the known antibiotic may be determined by measuring the activity of 

the target proliferation-required gene product rather than by measuring growth inhibition. 

Cells are contacted with a combination of each member of a panel of known antibiotics at a sublethal level and 

varying concentrations of the test antibiotic. As a control, the cells are contacted with varying concentrations of the test 

antibiotic alone. The IC S0 of the test antibiotic in the presence and absence of the known antibiotic is determined, tf the 

IC50s in the presence and absence of the known drug are substantially similar, then the test drug end the known drug act 

on different pathways. If the IC M s are substantially different, then the test drug and the known drug act on the same 

pathway. 

Another embodiment of the present invention is a method for identifying a candidate compound for use as an 
antibiotic in which the activity of target proteins or nucleic acids involved in proliferation-required pathways is reduced by 
contacting cells with a sublethal concentration of a known antibiotic which acts against the terget protein or nucleic acid. 
In one embodiment, the target protein or nucleic acid is a target protein or nucleic acid corresponding to a proliferation- 
required nucleic acid identified using the methods described above. The method is similar to those described above for 
identifying candidate compounds for use as antibiotics except that rather than reducing the activity or level of a 
proliferation-required gene product using a sublethal level of antisense to a proliferation-required nucleic acid, the activity 
or level of the proliferation-required gene product is reduced using a sublethal level of a known antibiotic which acts 
against the proliferation required gene product. 

The growth inhibition from the sublethal concentration of the known antibiotic may be at least about 5%, at 
least about 8%. at least about 10%, at least about 20%, at least about 30%, at least about 40%. at least about 50%, at 
least about 60%, or at least about 75%, or more. 



•61- 



WO 00/44906 



PCT/US00/02200 



10 



Alternatively, the sublethal concentration of the known antibiotic may be determined by measuring the activity of 
the target proliferation-required gene product rather than by measuring growth inhibition. 

In order to characterize test compounds of interest, cells are contacted with a panel of known antibiotics at a 
sublethal level and one or more concentrations of the test compound. As a control, the ceils are contacted with the same 
concentrations of the test compound alone. The IC 50 of the test compound in the presence and absence of the known 
antibiotic is determined. If the IC 50 of the test compound is substantially different in the presence and absence of the 
known drug then the test compound is a good candidate for use as an antibiotic. As discussed above, once a candidate 
compound is identified using the above methods its structure may be optimized using standard techniques such as 
combinatorial chemistry. 

Representative known antibiotics which may be used in each of the above methods are provided in the table 
below. However, it will be appreciated that other antibiotics may also be used. 



ANTIBIOTIC 



INHIBITS/TARGET 



RESISTANT MUTANTS 



Inhibitors of Transcription 

Rifamycin, 1959 Rifampicin 

Rifabutin Rifaximin 
Streptolysin 

Streptovaricin 
ActinomycinD+EDTA 



Inhibits initiation of transcription/6-subunit RNA 
polymerase, rpoB 

Accelerates transcription chain termination/^ 
subunit RNA polymerase 
an acyclic ansamycin, inhibits RNA polymerase 
Intercalates between 2 successive G-C pairs, 
rpoB, inhibits RNA synthesis 



Inhibitors of Nucleic Acid Metabolism 

Quinolones, 1962 Nalidixic acid subunit gyrase and/or topoisomerase IV, gyrA 
Oxolinic acid 

Fluoroquinolones Ciprofloxacin, subunit gyrase, gyrA and/or topoisomerase IV 
1983 Norfloxacin (probable target in Staph) 

Coumerins Novobiocin Inhibits ATPase activity of B-subunit gyrase, 

gyrB 

Coumermycin Inhibits ATPase activity of B-subunit gyrase, 

gyrB 

Albicidin DNA synthesis 

Metronidazole Causes single-strand breaks in DNA 

Inhibitors of Metabolic Pathways 

Sulfonamides, 1932 Sulfanilamide blocks synthesis of dihydrofolate,dihydro- 

pteroate synthesis, folP 
Trimethoprim, 1 962 Inhibits dihydrof olate reductase, folA 

Showdomycin Nucleoside analogue capable of alkylating 



rpoB, crp, cyaA 

rpoB 

rpoB 
pldA 



gyrAorB, icd, sloB 
gyrA 

norA (efflux in Staph) 



gyrB, cy$B, cysE, nov, 

ompA 
gyrB f hisW 

tar (nucleoside channel) 
nar 



folP, gpt, pabA, pabB, 
pabC 
folA, thyA 
nupC, pnp 
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ANTIBIOTIC 



INHIBITS/TARGET 



RESISTANT MUTANTS 



Thiolactomycin 



Psicofuranine 
Triclosan 

Diazoborines Isoniazid, 
Ethionamide 



sulfhydryl groups, inhibitor of thymidylate 
synthetase 

type II fatty acid synthase inhibitor emrB 

fadB, emrB due to gene 
dosage 

Adenosine glycoside antibiotic, target is GMP guaA,B 
synthetase 

Inhibits fatty acid synthesis fabf(envM) 
heterocyclic, contains boron, inhibit fatty acid fabl(envM) 
synthesis, enoyl-ACP reductase, fabl 



Inhibitors of Translation 

Phenylpropanoids 

Chloramphenicol, 1947 



Tetracyclines, 1948, type II 

poiyketides 

Minocycline 

Doxycycline 
Macrolides (type I poiyketides) 

Erythromycin, 1950 
Carbomycin, Spiramycin 

etc 

Aminoglycosides Streptomycin, 
1944 
Neomycin 

Spectinomycin Kanamycin 

Kasugamycin 

Gentamicin, 1963 

Amikacin 

Paromycin 
Lincosamides 

Lincomycin, 1955 Clindamycin 
Streptogramins Virginiamycin, 

1955 Pristinamycin 
Synercid: quinupristin 

/dalfopristin 
Fusidanes 

Fusidic Acid 
Kirromycin (Mocimycin) 



Binds to ribosomal peptidyl transfer center 
preventing peptide translocation/ binds to S6, 
L3, L6,L14,L16, L25, L26, L27, but 
preferentially to L16 

Binding to 30S ribosomal subunit, "A" site 
on 30S subunit, blocks peptide elongation, 
strongest binding to S7 

Binding to 50 S ribosomal subunit 23S rRNA, 
blocks peptide translocation, LI 5, L4, LI 2 



Irreversible binding to 30S ribosomal subunit, 
prevents translation or causes mistranslation of 
mRNA/16S rRNA 



rrn, cm/A, mar A ompF, 
ompR 

clmA (cmrl mar, ompF 



Binding to 50 S ribosomal subunit, blocks 
peptide translocation 

2 components, Streptogramins A&B, bind to 
the SOS ribosomal subunit blocking peptide 
translocation and peptide bond formation 



Inhibition of elongation factor G (EF-G) prevents fusA 

peptide translocation 

Inhibition of elongation factor TU (EFTu), tufA,B 
prevents peptide bond formation 



rm,rplC t rplD,rplV t mac 



rpsL, strCM ubiF 
atpA-E, ecfB, 

hemAC,D,E,G, topA, 
rpsC f D,E, rm, spcB 
atpA-atpE, cpxA, ecfB, 

hemA,B,L, topA 
ksgA,B,C,D,rplB,K, 

rpsl,N,M,R 
rplF, ubiF 
cpxA 
rpsL 

HnB, rplNA rp$G 
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ANTIBIOTIC 



INHIBITS/TARGET 



RESISTANT MUTANTS 



Pulvomycin 
Thiopeptin 

Tiarnulin 
Negamycin 

Oxazolidinones Linezolid 
Isoniazid 

Nitrofurantoin 



Pseudomonic Acids Mupirocin 

(Bactroban) 
Indolmycin 
Viomycin 



Thiopeptides 

Thiostrepton 

Micrococcin 



Binds to and inhibits EF-TU 
Sulfur-containing antibiotic, inhibits protein 

synthesis,EF-G 
Inhibits protein synthesis 
Inhibits termination process of protein 

synthesis 
23S rRNA 



Inhibits protein synthesis, nitroreductases 
convert nitrofurantoin to highly reactive 
eiectrophilic intermediates which attack 
bacterial ribosomai proteins non-specifically 

Inhibition of isoleucyl tRNA synthetase-used for 
Staph, topical cream, nasal spray 

Inhibits tryptophanyltRNA synthetase 



Binds to L1 1-23S RNA complex 
Inhibits GTP hydrolysis by EF-G 
Stimulates GTP hydrolysis by EF-G 



rplE 

rpIC, rpID 
prfB 



pttx 
nfnA,B 



HeS 
trpS 

rrmA (23$ rRNA 
methyltransferase; 
mutant has slow 
growth rate, slow chain 
elongation rate, and 
viomycin resistance) 



Inhibitors of Cell Walls/Membranes 
(Mactams 

Penicillin, 1929 Ampicillin 

Methiciilin, 1960 



Cephalosporins, 1962 
Meciliinam famdinocillin) 

Aztreonam (Furazloctliin) 
Bacilysin, Tetaine 
Glycopeptides Vancomycin, 1955 

Polypeptides Bacitracin 

Cyclic lipopeptide Daptomycin, 1980 



Inhibition of one or more cell wall 
transpeptidases, endopeptidases, and 
glycosidases (PBPs), of the 12 PBPs only 2 
are essential: mrdA (PBP2) and ftsKpbpB, 
PBP3) 



Binds to and inactivates PBP2 {mrdA) 
Inactivates PBP3 {ft$I\ 



ampC, ampD, ampE, 
emZ, galU, hip A, 
hipd ompC, ompF, 
ompR, ptst, rfa, tolD, 
to/E 

tonB 

alaS, argS, crp, cyaA, 
envB, mrdA,B, 
mreB,C,D 



Dipeptide, inhib glucosamine synthase 
Inhib G+ cell wall syn, binds to terminal D- 
ala-D-ala of pentapeptide, 
Prevents dephosphorylation and regeneration 
of lipid carrier rfa 
Disrupts multiple aspects of membrane 
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function, including peptidoglycan synthesis, 
lipoteichoic acid synthesis, and the bacterial 
membrane potential 

Cyclic polypeptides Polymixin, 1 939 Surfactant action disrupts cell membrane pmrA 

lipids, binds lipid A mioety of LPS 
Fosfomycin, 19B9 Analogue of P-enolpyruvate. inhibits 1" step murA. crp, cyaA glpT, 

in peptidoglycan synthesis • UDP-N- " hipA, ptsl. uhpT ' 

acetylglucosamine enolpyruvyl transferase, 

murA. Also acts as Immunosuppressant 
Cycloserine Prevents formation of O-ala dimer, inhibits D- hipA,cycA 

ala ligase, ddlA,B 

Ala,osfalin phosphonodipeptide, cell wall synthesis pepA, tpp 

inhibitor, potentiator of -lactams 

Inhibitors of Protein Processing/Transport 

G,obom Y cin Inhibits signal peptidase II {cleaves l PP ,dnaE 

prolipoproteins subsequent to lipid 
modification, IspA 

EXAMPLE 12 

Transfer of Exogenous Nucleic Acid Snqwnrw to other Bacterial Sneries Usinn thp E mnt™ ^ y firtnr!i nr F¥f frnw - nw 

Vectors Functional in Rar terial Soecies other than E coll. 
The above methods were validated using antisense nucleic acids which inhibit the growth of £ coll which were 
identified using methods similar to those described above. Expression vectors which inhibited growth of E. coli upon 
induction of antisense RNA expression with IPTG were transformed directly into Enterobacter cloacae, Klebsiella 
pneumonia or Salmonella typhimurium. The transformed cells were then assayed for growth inhibition according to the 
method of Example 1. After growth in liquid culture, cells were plated at various serial dilutions and a score determined by 
calculating the log difference in growth for INDUCED vs. UNINDUCED antisense RNA expression as determined by the 
maximum 10 fold dilution at which a colony was observed. The results of these experiments are listed below in Table VI. 
If there was no effect of antisense RNA expression in an organism, the clone is minus in Table VI. In contrast, a positive in 
Table VI means that at least 10 fold more cells were required to observe a colony on the induced plate than on the non- 
induced plate under the conditions used and in that organism. 

Sixteen of the construts were found to inhibit growth in all the organisms tested upon induction of antisense 
RNA expression with IPTG. Those skilled in the art will appreciate that a negative result in a heterologous organism does 
not mean that that organism is missing that gene nor does it mean that the gene is unessential. However, a positive result 
means that the heterologous organism contains a homologous gene which is required for proliferation of that organism. 
The homologous gene may be obtained using the methods described herein. Those cells that are inhibited by antisense 
may be used in cell based assays as described herein for the identification and characterization of compounds in order to 



-65- 



WO 00/44906 



PCT7US00/02200 



develop antibiotics effective in these organisms. Those skilled in the art will appreciate that an antisense molecule which 
works in the organism from which it was obtained will not always work in a heterologous organism. 



TABLE VI 

Sensitivity of Other Microorganisms to Antisens e Nucleic Acids That Inhibit Proliferation in E eofi 



Mol. No. 


S. typhimurium 


£ cloacae 


K. pneumoniae 


EcXAOOl 


+ 


+ 


• 


EcXA004 


• 


• 


• 


EcXA005 


+ 


+ 


+ 


EcXA006 


• 


• 


• 


EcXA007 


• 


+ 


- 


EcXA008 


+ 


• 


+ 


EcXAOlO 


+ 


+ 


+ 


EcXA011 


! 


+ 


• 


EcXA012 


! 


+ 


• 


EcXA013 


+ 


+ 


+ 


EcXAOH 


+ 


+ 


- 


EcXA015 


• 


+ 


+ 


EcXAOIB 


+ 


+ 


+ 


EcXA017 


+ 




+ 


EcXA018 


+ 


+ 


+ 


EcXA019 


+ 


+ 


+ 


CCAMU^U 


+ 


+ 


+ 


EcXA021 


+ 


+ 


+ 


EcXA023 


+ 


+ 


+ 


EcXA024 


+ 




+ 


EcXA025 








EcXA026 


+ 


+ 




EcXA027 


+ 


+ 


+ 


EcXA028 








EcXA029 
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Mot. No. 


S. typhimurium 


£ cloacae 


K. pneumoniae 


EcXA030 


+ 


+ 


+ 


EcXA031 


+ 






EcXA032 


I + 






EcXA033 


+ 


+ 


+ 


EcXA034 


+ 


+ 


+ 


EcXA035 








EcXA036 


+ 




+ 


EcXA037 




+ 




EcXA038 


+ 


+ 




EcXA039 


+ 






EcXA041 


+ 




+ 


EcXA042 




+ 


+ 


EcXAQ44 


• 


• 




EcXA045 


- 


+ 




; EcXA046 


- 






EcXA047 


+ 


+ 




EcXA048 


- 






EcXA049 


+ 






EcXA050 








EcXA051 


+ 


• 




EcXA052 


+ 






EcXA053 


+ 




+ 


ECXA054 






+ 


EcXA055 


+ 







EXAMPLE 13 

Use of Identified Exogenous Nucleic Acid Sentiences as Probes 
The identified sequence of the present invention can be used as probes to obtain the sequence of additional 
genes of interest from a second organism. For example, probes to potential bacterial target proteins may be hybridized to 
nucleic acids from other organisms including other bacteria and higher organisms, to identify homologous sequences. Such 
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hybridization might indicate that the protein encoded by the gene to which the probe corresponds is found in humans and 
therefore not necessarily a good drug target. Alternatively, the gene can be conserved only in bacteria and therefore 
would be a good drug target for a broad spectrum antibiotic or antimicrobial. 

Probes derived from the identified nucleic acid sequences of interest or portions thereof can be labeled with 
detectable labels familiar to those skilled in the art, including radioisotopes and non-radioactive labels, to provide a detectable 
probe. The detectable probe can be single stranded or double stranded and can be made using techniques known in the art, 
including in vitro transcription, nick translation, or kinase reactions. A nucleic acid sample containing a sequence capable of 
hybridizing to the labeled probe is contacted with the labeled probe. If the nucleic acid in the sample is double stranded, it can 
be denatured prior to contacting the probe. In some applications, the nucleic acid sample can be immobiGzed on a surface such 
as a nitrocellulose or nylon membrane. The nucleic acid sample can comprise nucleic acids obtained from a variety of sources, 
including genomic DNA, cDNA libraries, RNA, or tissue samples. 

Procedures used to detect the presence of nucleic acids capable of hybridizing to the detectable probe include well 
known techniques such as Southern blotting, Northern blotting, dot blotting, colony hybridization, and plaque hybridization. In 
some applications, the nucleic acid capable of hybridizing to the labeled probe can be cloned into vectors such as expression 
vectors, sequencing vectors, or in vitro transcription vectors to facilitate the characterization and expression of the hybridizing 
nucleic acids in the sample. For example, such techniques can be used to isolate, purify and clone sequences from a genomic 
library, made from a variety of bacterial species, which are capable of hybridizing to probes made from the sequences identified 
in Examples 5 and 6. 

EXAMPLE 14 

Preparation of PCR Primers and Amplification of DNA 
The identified E. coli genes corresponding directly to or located within the operon of nucleic acid sequences required 
for proliferation or portions thereof can be used to prepare PCR primers for a variety of applications, including the identification 
or isolation of homologous sequences from other species, for example S. typhimurium, £ cloacae, and Klebsiella pneumoniae, 
which contain part or all of the homologous genes. Because homologous genes are related but not identical in sequence, those 
skilled in the art will often employ degenerate sequence PCR primers. Such degenerate sequence praners are designed based on 
conserved sequence regions, either known or suspected, such as conserved coding regions. The successful production of a PCR 
product using degenerate probes generated from the sequences identified herein would indicate the presence of a homologous 
gene sequence in the species being screened. The PCR primers are at least 10 bases, and preferably at least 20 bases in 
length. More preferably, the PCR primers are at least 20-30 bases in length. In some embodiments, the PCR primers can be 
more than 30 bases in length. It is preferred that the primer pairs have approximately the same G/C ratio, so that melting 
temperatures are approximately the same. A variety of PCR techniques are familiar to those skilled in the art. For a review of 
PCR technology, see Molecular Cloning to Genetic Engineering White, B.A. Ed. in Methods in Molecular Biology 67: Humana 
Press, Totowa 1997. When the entire coding sequence of the target gene is known, the 5' and 3' regions of the target gene 



WO 00/44906 



PCT/US00/02200 



can be used as the sequence source for PCR probe generation. In each of these PCR procedures. PCR primers on either side of 
the nucleic acid sequences to be ampOfied are added to a suitably prepared nucleic acid sample along with dNTPs and a 
thermostable polymerase such as Taq polymerase, Pfu polymerase, or Vent polymerase. The nucleic acid in the sample is 
denatured and the PCR primers are specifically hybridized to complementary nucleic acid sequences in the sample. The 
hybridized primers are extended. Thereafter, another cycle of denaturation, hybridation, and extension is initiated. The cycles 
are repeated multiple times to produce an amplified fragment containing the nucleic acid sequence between the primer sites. 

EXAMPLE 15 
Inverse PCR 

The technique of inverse polymerase chain reaction can be used to extend the known nucleic acid sequence identified 
in Examples 5 and 6. The inverse PCR reaction is described generally by Ochman et aL, in Ch. 10 of PCR Technology: 
Principles and Applications for DNA Amplification, (Henry A. Eriich, Ed.) W.H. Freeman and Co. (1992). Traditional PCR 
requires two primers that are used to prime the synthesis of complementary strands of DNA. In inverse PCR, only a core 
sequence need be known. 

Using the sequences identified as relevant from the techniques taught in Examples 5 and 6 and applied to other 
species of bacteria, a subset of exogenous nucleic sequences are identified that correspond to genes or operons that are 
required for bacterial proliferation. In species for which a genome sequence is not known, the technique of inverse PCR 
provides a method for obtaining the gene in order to determine the sequence or to place the probe sequences in full context to 
the target sequence to which the identified exogenous nucleic acid sequence binds. 

To practice this technique, the genome of the target organism is digested with an appropriate restriction enzyme so 
as to create fragments of nucleic acid that contain the identified sequence as well as unknown sequences that flank the 
identified sequence. These fragments are then circularized and become the template for the PCR reaction. PCR primers are 
designed in accordance with the teachings of Example 15 and directed to the ends of the identified sequence are synthesized. 
The primers direct nucleic acid synthesis away from the known sequence and toward the unknown sequence contained within 
the circularized template. After the PCR reaction is complete, the resulting PCR products can be sequenced so as to extend the 
sequence of the identified gene past the core sequence of the identified exogenous nucleic acid sequence identified, in this 
manner, the full sequence of each novel gene can be identified. Additionally the sequences of adjacent coding and noncoding 
regions can be identified. 

EXAMPLE 16 

Identification of Genes Required for Staphylococcus aureus Proliferation 
Genes required for proliferation in Staphylococcus aureus are identified according to the methods described above. 

EXAMPLE 17 

Identification of Genes Required ior Neisseria gonorrhoeae Proliferation 
Genes required for proliferation in Neisseria gonorrhoeae are identified according to the methods described above. 
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EXAMPLE 18 

Identification of Genes Required for P seudomonas aeruginosa Proiiferatinn 
Genes required for proliferation in Pseudomonas aeruginosa are identified according to the methods described 

above. 

5 EXAMPLE 19 

Identification of Genes Required for Enterococcus faecalis Proliferation 
Genes required for proliferation in Enterococcus faecalis are identified according to the methods described above. 

EXAMPLE 20 

Identification of Genes Required for Haemophilus influenzae Proliferation 
10 Genes required ^r proliferation in Haemophilus influenzae are identified according to the methods described above. 

EXAMPLE 21 

identification of Genes Required for Salmonella typhimurium Proliferation 
Genes required for proliferation in Salmonella typhimurium are identified according to the methods described above. 

EXAMPLE 22 

1 5 Identification of Genes Required for Helicobacter pylori Proliferation 

Genes required for proliferation in Helicobacter pylori are identified according to the methods described above. 

EXAMPLE 23 

Identification of Genes Required for Mycoplasma pneumnniae Proiiferatinn 
Genes required for proliferation in Mycoplasma pneumoniae are identified according to the methods described 

20 above. 

EXAMPLE 24 

Identification of Genes Required for Plasmodium ovale Proliferation 
Genes required for proliferation in Plasmodium oyale are identified according to the methods described above. 

EXAMPLE 25 

25 Identification of Genes Required for Saccharomyces cereyisiae Proiiferatinn 

Genes required for proliferation in Saccharomyces cereyisiae are identified according to the methods described 

above. 

EXAMPLE 26 

Identification of Genes Required for Entamoeba histolytica Proiiferatinn 
30 Gen8S rei P r ed for proliferation in Entamoeba histolytica are identified according to the methods described above. 

EXAMPLE 27 

Identification of Genes Reouired for Candida albicans Proliferation 
Genes required for proliferation in Candida albicans are identified according to the methods described above. 
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EXAMPLE 28 

Identification of Genes Required for Kleb siella pneumoniae Proliferation 
Genes required for proliferation in Klebsiella pneumoniae are identified according to the methods described above. 

EXAMPLE 29 

5 Identification of Genes Required for Salmonella typhi Proliferation 

Genes required for proliferation in Salmonella typhis* identified according to the methods described above. 

EXAMPLE 30 

Identification of Genes Required for Salmonella paratyphi Proliferation 
Genes required for proliferation in Salmonella paratyphi are identified according to the methods described above. 
10 EXAMPLE 31 

Identification of Genes Required for Salmonella cholerasuis Proliferation 
Genes required for proliferation in Salmonella cholerasuis are identified according to the methods described above. 

EXAMPLE 32 

Identification of Genes Required for Staphylococcus epidermis Proliferation 
15 Genes required for proliferation in Staphylococcus epidermis are identified according to the methods described 

above. 

EXAMPLE 33 

Identification of Genes Required for Mycobacterium tu berculosis Proliferation 
Genes required for proliferation in Mycobacterium tuberculosis are identified according to the methods described 

20 above. 

EXAMPLE 34 

Identification of Genes Required for Mycobacterium leprae Proliferation 
Genes required for proliferation in Mycobacterium leprae are identified according to the methods described above. 

EXAMPLE 35 

25 Identification of Genes Required for Treponema pa llidum Proliferation 

Genes required for proliferation in Treponema pallidum are identified according to the methods described above. 

EXAMPLE 36 

Identification of Genes Required for Bacillus anthracis Proliferation 
Genes required for proliferation in Bacillus anthracis are identified according to the methods described above. 
30 EXAMPLE 37 

Identification of Genes Required for Yersinia oestis Proliferation 
Genes required for proliferation in Yersinia pestis are identified according to the methods described above. 
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EXAMPLE 38 

Identification of Genes Require d for Clostridium botulinum Proliferation 
Genes required for proliferation in Clostridium botulinum are identified according to the methods described above. 

EXAMPLE 39 

Identification of Genes Re quired for Campylobacter jejuni Proliferation 
Genes required for proliferation in Campylobacter jejuni are identified according to the methods described above. 

EXAMPLE 40 

Identification of Genes Required for Chlamydia trachomatis Proliferation 

Genes required for proliferation in Chlamydia trachomatis are identified according to the methods described above. 
Use of Isolated Exogenous Nucleic Acid Fragments as Antisense Antibiotics 

In addition to using the identified sequences to enable screening of molecule libraries to identify compounds useful to 
identify antibiotics, the sequences themselves can be used as therapeutic agents. Specifically, the identified exogenous 
sequences in an antisense orientation can be provided to an individual to inhibit the translation of a bacterial target gene. 
Generation of An tisense Therapeutics from Identified Exogenous Sequences 

The sequences of the present invention can be used as antisense therapeutics for the treatment of bacterial 
infections or simply for inhibition of bacterial growth in yitro or in viyo. The therapy exploits the biological process in cells 
where genes are transcribed into messenger RNA (mRNA) that is then translated into proteins. Antisense RNA technology 
contemplates the use of antisense oligonucleotides directed against a target gene that will bind to its target and decrease 
or inhibit the translation of the target mRNA. In one embodiment, antisense oligonucleotides can be used to treat and 
control a bacterial infection of a cell culture containing a population of desired cells contaminated with bacteria. In 
another embodiment, the antisense oligonucleotides can be used to treat an organism with a bacterial infection. 

Antisense oligonucleotides can be synthesized from any of the sequences of the present invention using methods 
well known in the art. in a preferred embodiment, antisense oligonucleotides are synthesized using artificial means. 
Uhimann & Peymann, Chemical Rev. 90:543-584 (1990) review antisense oligonucleotide technology in detail. Modified or 
unmodified antisense oligonucleotides can be used as therapeutic agents. Modified antisense oligonucleotides are 
preferred since it is well known that antisense oligonucleotides are extremely unstable. Modification of the phosphate 
backbones of the antisense oligonucleotides can be achieved by substituting the internucleotide phosphate residues with 
methylphosphonates, phosphorothioates, phosphoramidates, and phosphate esters. Nonphosphate internucleotide analogs 
such as siloxane bridges, carbonate brides, thioester bridges, as well as many others known in the art. The preparation of 
certain antisense oligonucleotides with modified internucleotide linkages is described in U.S. Patent No. 5,142,047, hereby 
incorporated by reference. 

Modifications to the nucleoside units of the antisense oligonucleotides are also contemplated. These 
modifications can increase the half-life and increase cellular rates of uptake for the oligonucleotides in yiyo. For example. 
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a-anomeric nucleotide units and modified bases such as 1.2-dideoxy-d-ribofuranose. 1.2-dideoxy-l-phenylribofuranose, and 
ft , ^-ethano-Smethyl-eytosine are contemplated for use in the present invention. 

An additional form of modified antisense molecules is found in peptide nucleic acids. Peptide nucleic acids (PNA) 
have been developed to hybridize to single and double stranded nucleic acids. PNA are nucleic acid analogs in which the entire 
deoxyribose-phosphate backbone has been exchanged with a chemically completely different, but structurally homologous, 
polyamide (peptide) backbone containing 2-aminoethyl glycine units. Unlike DNA, which is highly negatively charged, the PNA 
backbone is neutral Therefore, there is much less repulsive energy between complementary strands in a PNA-ONA hybrid than 
in the comparable DNA-ONA hybrid, and consequently they are much more stable. PNA can hybridize to ONA in either a 
Watson/Crick or Hoogsteen fashion (Demidov et aL. Proc. Natl. Acad. Sci. USA 92:2637-2641, 1995; Egholm, Nature 
365:566-568. 1993; Nielsen et aL, Science 254:1497-1500, 1991; Dueholm et z\.,NewJ. Chem. 21:19-31, 1997). 

Molecules called PNA "clamps" have been synthesized which have two identical PNA sequences joined by a flexible 
hairpin linker containing three 8amino-3,6dioxaoctanoic acid units. When a PNA clamp is mixed with a complementary 
homopurine or homopyrimidine DNA target sequence, a PNA DNA-PNA triplex hybrid can form which has been shown to be 
extremely stable IBentin et aL, Biochemistry 35:8863-8869, 1996; Egholm et al. Nucleic Acids Res. 23:217-222. 1995; 
Griffith et al., J. Am. Chem. Soc. 117:831-832, 1995). 

The sequence-specific and high affinity duplex and triplex binding of PNA have been extensively described (Nielsen et 
al., Science 254:1497-1500, 1991; Egholm et al., J. Am. Chem. Soc. 1 14:9677-9678, 1992; Egholm et aL, Nature 365:566- 
568, 1993; Almarsson et aL, Proc. Natl. Acad. Sci. USA 90:9542-9546, 1993; Demidov et al., Proc Nat/. Acad. Sci. USA 
92:2637-2641, 1995). They have also been shown to be resistant to nuclease and protease digestion (Demidov et al, 
Biochem. Pharm. 48:1010-1313, 1994). PNA has been used to inhibit gene expression (Hanvey et aL, Science 258:1481- 
1485,1992; Nielsen et al.. Nucl. Acids. Res., 21:197-200, 1993; Nielsen et aL. Gene 149:139-145, 1994; Good & Nielsen, 
Science, 95: 2073-2076, 1998; all of which are hereby incorporated by reference), to block restriction enzyme activity 
(Nielsen et al., supra., 1993), to act as an artificial transcription promoter (Mollegaard, Proc. Natl. Acad. Sci. USA 91:3892- 
3895, 1994) and as a pseudo restriction endonuclease (Demidov et al.. Nucl. Acids. Res. 21:2103-2107, 1993). Recently, PNA 
has also been shown to have antiviral and antitumoral activity mediated through an antisense mechanism (Nortoa Nature 
Biotechnol.. 14:615-619, 1996; Hirschman et al., J. Investig. Med. 44:347-351, 1996). PNAs have been linked to various 
peptides in order to promote PNA entry into cells (Basu et al., Bioconj. Chem. 8:481-488, 1997; Partridge et al., Proc. Natl. 
Acad. Sci. USA 92:5592-5596, 1995). 

The antisense oligonucleotides contemplated by the present invention can be administered by direct application 
of oligonucleotides to a target using standard techniques well known in the art. The antisense oligonucleotides can be 
generated within the target using a plasmid. or a phage. Alternatively, the antisense nucleic acid may be expressed from a 
sequence in the chromosome of the target cell. It is further contemplated that contemplated that the antisense 
oligonucleotide contemplated are incorporated in a ribozyme sequence to enable the antisense to specifically bind and cleave its 
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target mRNA. For technical applications of ribozyme and antisense oligonucleotides see Rossi et al. f Pharmacol. Ther. 
50(2h245-254, (1991), which is hereby incorporated by reference. The present invention also contemplates using a retron to 
introduce an antisense oligonucleotide to a cell. Retron technology is exemplified by U.S. Patent No. 5,405,775, which is 
hereby incorporated by reference. Antisense oligonucleotides can also be delivered using liposomes or by electroporation 
techniques which are well known in the art. 

The antisense nucleic acids of the present invention can also be used to design antibiotic compounds comprising 
nucleic acids which function by intracellular triple helix formation. Triple helix oligonucleotides are used to inhibit transcription 
from a genome. The sequences identified as required for proliferation in the present invention, or portions thereof, can be used 
as templates to inhibit microorganism gene expression in individuals infected with such organisms. Traditionally, homopurine 
sequences were considered the most useful for triple helix strategies. However, homopyrimidine sequences can also inhibit 
gene expression. Such homopyrimidine oligonucleotides bind to the major groove at homopurine:homopyrimidine sequences. 
Thus, both types of sequences based on the sequences of the present invention that are required for proliferation are 
contemplated for use as antibiotic compound templates. 

The antisense oligonucleotides of this example employ the identified sequences of the present invention to induce 
bacterial cell death or at least bacterial stasis by inhibiting target gene translation. Antisense oligonucleotides containing 
from about 8 to 40 bases of the sequences of the present invention have sufficient complementary to form a duplex with 
the target sequence under physiological conditions. 

To kill bacterial cells or inhibit their growth, the antisense oligonucleotides are applied to the bacteria or to the 
target cells under conditions that facilitate their uptake. These conditions include sufficient incubation times of cells and 
oligonucleotides so that the antisense oligonucleotides are taken up by the cells. In one embodiment, an incubation period 
of 7-10 days is sufficient to kill bacteria in a sample. An optimum concentration of antisense oligonucleotides is selected 
for use. 

The concentration of antisense oligonucleotides to be used can vary depending on the type of bacteria sought to 
be controlled, the nature of the antisense oligonucleotide to be used, and the relative toxicity of the antisense 
oligonucleotide to the desired cells in the treated culture. Antisense oligonucleotides can be introduced to cell samples at a 
number of different concentrations preferably between 1x10 l0 M to 1x10 4 M. Once the minimum concentration that can 
adequately control gene expression is identified, the optimized dose is translated into a dosage suitable for use in vivo. For 
example, an inhibiting concentration in culture of 1x10 7 translates into a dose of approximately 0.6 mg/kg body weight. Levels 
of oligonucleotide approaching 100 mg/kg body weight or higher may be possible after testing the toxicity of the oligonucleotide 
in laboratory animals. It is additionally contemplated that cells from the subject are removed, treated with the antisense 
oligonucleotide, and reintroduced into the subject. This range is merely illustrative and one of skill in the art are able to 
determine the optimal concentration to be used in a given case. 



-74- 



WO 00/44906 



PCT/US00/02200 



After the bacterial cells have been killed or controlled in a desired culture, the desired cell population may be used 
for other purposes. 

EXAMPLE 41 

The following example demonstrates the ability of an £ a?// antisense oligonucleotide to act as a bactericidal or 
bacteriostatic agent to treat a contaminated cell culture system. The application of the antisense oligonucleotides of the 
present invention are thought to inhibit the translation of bacterial gene products required for proliferation. 

The antisense oligonucleotide of this example corresponds to a 30 base phophorothioate modified 
oligodeoxynucelotide complementary to a nucleic acid involved in proliferation, such as Molecule Number EcXAOOI. A 
sense oligodeoxynucelotide complementary to the antisense sequence is synthesized and used as a control. The 
oligonucleotides are synthesized and purified according to the procedures of Matsukura, et al., Gene 71343 (1988). The 
test oligonucleotides are dissolved in a small volume of autoclaved water and added to culture medium to make a 100 
micromolar stock solution. 

Human bone marrow cells are obtained from the peripheral blood of two patients and cultured according standard 
procedures well known in the art. The culture is contaminated with the K-12 strain of £ coli and incubated at 37°C 
overnight to establish bacterial infection. 

The control and antisense oligonucleotide containing solutions are added to the contaminated cultures and 
monitored for bacterial growth. After a 10 hour incubation of culture and oligonucleotides, samples from the control and 
experimental cultures are drawn and analyzed for the translation of the target bacterial gene using standard 
microbiological techniques well known in the art. The target £ atfgene is found to be translated in the control culture 
treated with the control oligonucleotide, however, translation of the target gene in the experimental culture treated with 
the antisense oligonucleotide of the present invention is not detected or reduced. 

EXAMPLE 42 

A subject suffering from an £ coli infection is treated with the antisense oligonucleotide preparation of Example 
39. The antisense oligonucleotide is provided in a pharmaceutical^ acceptable carrier at a concentration effective to 
inhibit the translation of the target gene. The present subject is treated with a concentration of antisense oligonucleotide 
sufficient to achieve a blood concentration of about 100 micromolar. The patient receives daily injections of antisense 
oligonucleotide to maintain this concentration for a period of 1 week. At the end of the week a blood sample is drawn and 
analyzed for the presence or absence using standard techniques well known in the art. There is no detectable evidence of 
E. coli and the treatment is terminated. 

EXAMPLE 43 

Preparation and use of Triple Helix Probes 
The sequences of microorganism genes required for proliferation of the present invention are scanned to identify 10- 
mer to 2Q.mer homopyrimidine or homopurine stretches that could be used in triple heKx based strategies for inhibiting gene 
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expression. Following identification of candidate homopyrimidine or homopurine stretches, their efficiency in inhibiting gene 
expression is assessed by introducing varying amounts of ofigonucieotides containing the candidate sequences into a population 
of bacterial cells that normally express the target gene. The oligonucleotides may be prepared on an oligonucleotide synthesizer 
or they may be purchased commercially from a company specializing in custom oligonucleotide synthesis, such as GENSET, 
Paris, France. 

The oligonucleotides can be introduced into the cells using a variety of methods known to those skilled in the art, 
including but not limited to calcium phosphate precipitation, DEAE-Dextran, electroporation, liposome-mediated transfection or 
native uptake. 

Treated cells are monitored for a reduction in proliferation using techniques such as monitoring growth levels as 
compared to untreated cells using optical density measurements. The oligonucleotides that are effective in inhibiting gene 
expression in cultured cells can then be introduced in vivo using the techniques well known in that art at a dosage level shown 
to be effective. 

In some embodiments, the natural (beta) anomers of the oligonucleotide units can be replaced with alpha anomers to 
render the oligonucleotide more resistant to nucleases. Further, an intercalating agent such as ethidium bromide, or the like, can 
be attached to the 3 end of the alpha oligonucleotide to stabilize the triple helix. For information on the generation of 
oligonucleotides suitable for triple helix formation see Griffin et al. (Science 245:967-971 (1989), which is hereby incorporated 
by this reference). 

EXAMPLE 44 

Identification of Bacterial Strains from Isolated Specimens bv PCR 
Classical bacteriological methods for the detection of various bacterial species are time consuming and costly. 
These methods include growing the bacteria isolated from a subject in specialized media, cultivation on selective agar 
media, followed by a set of confirmation assays that can take from 8 to 10 days or longer to complete. Use of the 
identified sequences of the present invention provides a method to dramatically reduce the time necessary to detect and 
identify specific bacterial species present in a sample. 

In one exemplary method, bacteria are grown in enriched media and DNA samples are isolated from specimens of f for 
example, blood, urine, stool, saliva or central nervous system fluid by conventional methods. A panel of PCR primers based on 
identified sequences unique to various species of microorganisms are then utilized in accordance with Example 12 to amplify 
DNA of approximately 100*200 bases in length from the specimen. A separate PCR reaction is set up for each pair of PCR 
primers and after the PCR reaction is complete, the reaction mixtures are assayed for the presence of PCR product. The 
presence or absence of bacteria from the species to which the PCR primer pairs belong is determined by the presence or 
absence of a PCR product in the various test PCR reaction tubes. 

Although the PCR reaction is used to assay the isolated sample for the presence of various bacterial species, other 
assays such as the Southern blot hybridization are also contemplated. 
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WHAT IS CLAIMED IS : 

1. A purified or isolated nucleic acid sequence consisting essentially of one of SEQ 10 NOs: 405485, 
wherein said nucleic acid inhibits microorganism proliferation. 

2. The nucleic acid sequence of Claim 1, wherein said nucleic acid sequence is complementary to at least 
a portion of a coding sequence of a gene whose expression is required for microorganism proliferation. 

3. The nucleic acid sequence of Claims 1 or 2, wherein said nucleic acid comprises a fragment of one of 
SEQ ID NOs. 405485, said fragment selected from the group consisting of fragments comprising at least 10, at feast 20, 
at least 25, at least 30, at least 50 or more than 50 consecutive bases of one of SEQ ID NOs: 405485. 

4. The nucleic acid sequence of Claim 3, wherein said nucleic acid sequence is complementary to a coding 
sequence of a gene whose expression is required for microorganism proliferation. 

5. A vector comprising a promoter operably linked to a nucleic acid comprising a sequence selected from 
the group consisting of SEQ ID NOs. 405485. 

6. The vector of Claim 5, wherein said promoter is active in an organism selected from the group 
consisting of Escherichia coll Staphylococcus aureus, Pseudomonas aeruginosa, Enterobacter cloacae, Helicobacter pylori, 
Neisseria gonorrhoeae, Enterococcus faecalis, Streptococcus pneumoniae, Haemophilus influenzae, Salmonella 
typhimurium, Saccharomyces cerevisiae, Candida albicans, Cryptococcus neoformans, Aspergillus fumigates, Klebsiella 
pneumoniae. Salmonella typhi, Salmonella paratyphi, Salmonella cholerasuis, Staphylococcus epidermidis, Mycobacterium 
tuberculosis, Mycobacterium leprae, Treponema pallidum, Bacillus anthracis. Yersinia pestis, Clostridium botulinum, 
Campylobacter jejuni, Chlamydia trachomatus, Chlamydia pneumoniae or any species falling within the genera of any of the 
above species. 

7. A host ceil containing the vector of Claim 5 or Claim 6. 

8. A purified or isolated nucleic acid consisting essentially of the coding sequence of one of SEQ ID NOs: 
82-88, 90-242. 

9. A fragment of the nucleic acid of Claim 8, said fragment comprising at least 10, at least 20, at least 
25, at least 30, at least 50 or more than 50 consecutive bases of one of SEQ ID NOs: 82-88, 90-242. 

1 0. A vector comprising a promoter operably linked to the nucleic acid of Claim 8 or Claim 9. 

11. A purified or isolated nucleic acid comprising a nucleic acid sequence complementary to at least a 
portion of an intragenic sequence, intergenic sequence, sequences spanning at least a portion of two or more genes, 5' 
noncoding region, or 3' noncoding region within an operon encoding a polypeptide comprising a sequence selected from the 
group consisting of SEQ ID NOs: 243-357, 359-398. 

12. A purified or isolated nucleic acid comprising a nucleic acid having at least 70% homology to a 
sequence selected from the group consisting of SEQ ID NOs 405485, 82-88, 90-242 or the sequences complementary 
thereto as determined using BLASTN version 2.0 with the default parameters. 
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13. The nucleic acid of Claim 12, wherein said nucleic acid is from an organism selected from the group 
consisting of Staphylococcus aureus, Pseudomonas aeruginosa, Enterobacter cloacae, Helicobacter pylori, Neisseria 
gonorrhoeae, Enterococcus faecalis, Streptococcus pneumoniae, Haemophilus influenzae, Salmonella typhimurium, 
Saccharomyces cerevisiae, Candida albicans, Cryptococcus neoiormans, Aspergillus fumigatus, Klebsiella pneumoniae, 
Salmonella typhi, Salmonella paratyphi, Salmonella cholerasuis, Staphylococcus epidermidis, Mycobacterium tuberculosis, 
Mycobacterium leprae, Treponema pallidum, Bacillus anthracis, Yersinia pestis, Clostridium botulinum, Campylobacter 
jejuni, and Chlamydia trachomatus, Chlamydia pneumoniae or any species falling within the genera of any of the above 
species. 

14. A purified or isolated nucleic acid consisting essentially of a nucleic acid encoding a polypeptide having 
a sequence selected from the group consisting of SEQ ID NOs.: 243-357, 359-398. 

15. A vector comprising a promoter operably linked to a nucleic acid encoding a polypeptide having a 
sequence selected from the group consisting of SEQ ID NOs.: 243-357, 359*398. 

1 6. A host cell containing the vector of Claim 1 5. 

1 7. A purified or isolated polypeptide comprising the sequence of one of SEQ ID NOs: 243-357, 359-398. 

18. A purified or isolated polypeptide comprising a fragment of one of the polypeptides of SEQ ID NOs. 
243-357, 359-398, said fragment selected from the group consisting of fragments comprising at least 5, at least 10, at 
least 20, at least 30, at least 40, at least 50, at least 60 or more than 60 consecutive amino acids of one of the 
polypeptides of SEQ ID NOs.: 243-357, 359-398. 

19. An antibody capable of specifically binding the polypeptide of Claim 17 or Claim 18. 

20. A method of producing a polypeptide, comprising introducing a vector comprising a promoter operably 
linked to a nucleic acid encoding a polypeptide having a sequence selected from the group consisting of SEQ ID NOs. 243- 
357, 359-398into a cell. 

21 . The method of Claim 20, further comprising the step of isolating said protein. 

22. A method of inhibiting proliferation comprising inhibiting the activity or reducing the amount of a 
polypeptide having a sequence selected from the group consisting of SEQ ID NOs. 243-357, 359-398or inhibiting the 
activity or reducing the amount of a nucleic acid encoding said polypeptide. 

23. A method for identifying compounds which influence the activity of a polypeptide required for 
proliferation comprising: 

contacting a polypeptide having a sequence selected from the group consisting of 243-357, 359-398 
with a candidate compound; and 

determining whether said compound influences the activity of said polypeptide. 

24. The method of Claim 23, wherein said activity is an enzymatic activity. 

25. The method of Claim 23, wherein said activity is a carbon compound catabolism activity. 
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26. The method of Claim 23, wherein said activity is a biosynthetic activity. 

27. The method of Claim 23, wherein said activity is a transporter activity. 

28. The method of Claim 23, wherein said activity is a transcriptional activity. 

29. The method of Claim 23, wherein said activity is a DNA replication activity. 
5 30. The method of Claim 23, wherein said activity is a cell division activity. 

31. A method for assaying compounds for the ability to reduce the activity or level of a polypeptide 
required for proliferation, comprising: 

providing a target, wherein said target comprises the coding sequence of a sequence selected from the 
group consisting of SEQ ID NOs. 82-88, 90-242; 
1 0 contacting said target with a candidate compound; and 

measuring an activity of said target. 

32. The method of Claim 31, wherein said target is a messenger RNA molecule transcribed from a coding 
region of one of SEQ ID. NOs.: 82-88, 90-242 and said activity is translation of said messenger RNA. 

33. The method of Claim 32, wherein said target is a coding region of one of SEQ ID. NQs. 82-88, 90-242 
15 and said activity is transcription of said messenger RNA. 

34. A compound identified using the method of Claim 31. 

35. A method for identifying compounds which reduce the activity or level of a gene product required for 
cell proliferation comprising the steps of: 

expressing an antisense nucleic acid against a nucleic acid encoding said gene product in a cell to 
20 reduce the activity or amount of said gene product in said cell, thereby producing a sensitized cell; 

contacting said sensitized ceil with a compound; and 

determining whether said compound inhibits the growth of said sensitized cell to a greater extent than 
said compound inhibits the growth of a nonsensitized cell. ' 

36. The method of Claim 35, wherein said cell is selected from the group consisting of bacterial cells, 
25 fungal cells, plant ceils, and animal cells. 

37. The method of Claim 36, wherein said cell is an E coli cell. 

38. The method of Claim 36, wherein said cell is from an organism selected from the group consisting of 
Staphylococcus aureus, Pseudomonas aeruginosa, Enterobacter cloacae, Helicobacter pylori, Neisseria gonorrhoeae, 
Enterococcus faecalis, Streptococcus pneumoniae, Haemophilus influenzae, Salmonella typhimurium, Saccharomyces 

30 cerevisiae, Candida albicans, Cryptococcus neoformans, Aspergillus fumigatus, Klebsiella pneumoniae, Salmonella typhi, 

Salmonella paratyphi, Salmonella choferasuis, Staphylococcus epidermidis, Mycobacterium tuberculosis, Mycobacterium 
leprae, Treponema pallidum, Bacillus anthracis. Yersinia pestis, Clostridium botulinum, Campylobacter jejuni, and Chlamydia 
trachomatus, Chlamydia pneumoniae or any species falling within the genera of any of the above species. 
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39. The method of Claim 35, wherein said antisense nucleic acid is transcribed from an inducible promoter. 

40. The method of Claim 39, further comprising the step of contacting said cell with a concentration of 
inducer which induces said antisense nucleic acid to a sublethal level. 

41. The method of Claim 40, wherein said sub lethal concentration of said inducer is such that growth 
inhibition is 8% or more, 

42. The method of Claim 40, wherein said inducer is isopropyl-1-thio-p-D-galactoside. 

43. The method of Claim 35, wherein growth inhibition is measured by monitoring optical density of a 
culture growth solution. 

44. The method of Claim 35, wherein said gene product is a polypeptide. 

45. The method of Claim 35, wherein said gene product is an RNA. 

46. The method of Claim 44, wherein said gene product comprises a polypeptide having a sequence 
selected from the group consisting of SEQ ID NOs.: 243-357, 359-398. 

47. A compound identified using the method of Claim 35. 

48. A method for inhibiting cellular proliferation comprising introducing a compound with activity against a 
gene corresponding to one of SEQ ID NOs.: 82 88, 90-242 or with activity against the product of said gene into a 
population of cells expressing a gene. 

49. The method of Claim 48, wherein said compound is an antisense oligonucleotide comprising a sequence 
selected from the group consisting of SEQ ID NOs.: 405-485, or a proliferation-inhibiting portion thereof. 

50. The method of Claim 49, wherein said proliferation inhibiting portion of one of SEQ ID NOs. 405485 is 
a fragment comprising at least 10, at least 20, at least 25, at least 30, at least 50 or more than 50 consecutive bases of 
one of SEQ ID NOs: 405485. 

51. The method of Claim 48, wherein said compound is a triple helix oligonucleotide. 

52. A preparation comprising an effective concentration of an antisense oligonucleotide comprising a 
sequence selected from the group consisting of SEQ ID NOs.: 405485, or a proliferation-inhibiting portion thereof in a 
pharmaceutical^ acceptable carrier. 

53. The preparation of Claim 52, wherein said proliferation-inhibiting portion of one of SEQ ID NOs. 405- 
485 comprises at least 10, at least 20, at least 25, at least 30, at least 50 or more than 50 consecutive bases of one of 
SEQ ID NOs: 405485. 

54. A method for inhibiting the expression of a gene in an operon required for proliferation comprising 
contacting a cell in a cell population with an antisense nucleic acid, said cell expressing a gene corresponding to one of 
SEQ ID NOs.: 82-88, 90-242, wherein said antisense nucleic acid comprises at least a proliferation-inhibiting portion of 
said operon in an antisense orientation that is effective in inhibiting expression of said gene. 
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55. The method of Claim 54, wherein said antisense nucleic acid is complementary to a sequence of a gene 
comprising one or more of SEQ 10 NOs.: 82-88, 90-241 

56. The method of Claim 54, wherein said antisense nucleic acid is a sequence of one of SEQ ID N0$.: 405- 
485, or a portion thereof. 

57. The method of Claim 54, wherein said cell is contacted with said antisense nucleic acid by introducing 
a plasmid which expresses said antisense nucleic acid into said cell population. 

58. The method of Claim 54, wherein said ceil is contacted with said antisense nucleic acid by introducing 
a phage which expresses said antisense nucleic acid into said cell population. 

59. The method of Claim 54, wherein said cell is contacted with said antisense nucleic acid by introducing 
a sequence encoding said antisense nucleic acid into the chromosome of said cell into said cell population. 

60. The method of Claim 54, wherein said cell is contacted with said antisense nucleic acid by introducing 
a retron which expresses said antisense nucleic acid into said cell population. 

61. The method of Claim 54, wherein said cell is contacted with said antisense nucleic acid by introducing 
a ribozyme into said cell-population, wherein a binding portion of said ribozyme is complementary to said antisense 
oligonucleotide. 

62. The method of Claim 54, wherein said ceil is contacted with said antisense nucleic acid by introducing 
a liposome comprising said antisense oligonucleotide into said cell. 

63. The method of Claim 54, wherein said cell is contacted with said antisense nucleic acid by 
elect rop oration. 

64. The method of Claim 54, wherein said antisense nucleic acid is a fragment comprising at least 10 r at 
least 20, at least 25, at least 30, at least 50 or more than 50 consecutive bases of one of SEQ ID NOs: 82-88, 90-242. 

65. The method of Claim 54 wherein said antisense nucleic acid is an oligonucleotide. 

66. A method for identifying bacterial strains comprising the steps of: 
providing a sample containing a bacterial species; and 

identifying a bacterial species using a species specific probe having a sequence selected from the group 
consisting of SEQ ID NOs. 405485, 82-88, 90-242. 

67. A method for identifying a gene in a microorganism required for proliferation comprising: 

(a| identifying an inhibitory nucleic acid which inhibits the activity of a gene or gene product 
required for proliferation in a first microorganism; 

(b) contacting a second microorganism with said inhibitory nucleic acid; 

(c) determining whether said inhibitory nucleic acid. .from said first microorganism- inhibits 
proliferation of said second microorganism; and 
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(d) identifying the gene in said second microorganism which is inhibited by said inhibitory nucleic 

acid. 

68. A method for assaying a compound for the ability to inhibit proliferation of a microorganism comprising: 

(a) identifying a gene or gene product required for proliferation in a first microorganism; 

(b) identifying a homolog of said gene or gene product in a second microorganism; 

(c) identifying an inhibitory nucleic acid sequence which inhibits the activity of said homolog in 
said second microorgansim; 

(d) contacting said second microorganism with a proliferation-inhibiting amount of said inhibitory 
nucleic acid, thus sensitizing said second microorganism; 

(e) contacting the sensitized microorganism of step (d) with a compound; and 

(f) determining whether said compound inhibits proliferation of said sensitized microorganism to 
a greater extent than said compound inhibits proliferation of a nonsensitized microorganism. 

69. The method of Claim 68, wherein said step of identifying a gene involved in proliferation in a first 
microorganism comprises: 

introducing a nucleic acid comprising a random genomic fragment from said first microorganism 
operably linked to a promoter wherein said random genomic fragment is in the antisense orientation; and 

comparing the proliferation of said first microorganism transcribing a first level of said random genomic 
fragment to the proliferation of said first microorganism transcribing a lower level of said random genomic 
fragment, wherein a difference in proliferation indicates that said random genomic fragment comprises a gene 
involved in proliferation. 

70. The method of Claim 69, wherein said step of identifying a homolog of said gene in a second 
microorganism comprises identifying a homologous nucleic acid or a nucleic acid encoding a homologous polypeptide in a 
database using an algorithm selected from the group consisting of BLASTN version 2.0 with the default parameters and 
FASTA version 3.0t78 algorithm with the default parameters. 

71. The method of Claim 69, wherein said step of identifying a homolog of said gene in a second 
microorganism comprises identifying a homologous nucleic acid or a nucleic acid encoding a homologous polypeptide by 
identifying nucleic acids which hybridize to said first gene. 

72. The method of Claim 69, wherein the step of identifying a homolog of said gene in a second 
microorganism comprises expressing a nucleic acid which inhibits the proliferation of said first microorganism in said 
second microorganism. 

73. The method of Claim 69, wherein said inhibitory nucleic acid is an antisense nucleic acid. 

74. The method of Claim 69, wherein said inhibitory nucleic acid comprises an antisense nucleic acid to a 
portion of said homolog. 
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75. The method of Claim 69, wherein said inhibitory nucleic acid comprises an antisense nucleic acid to a 
portion of the operon encoding said homolog. 

76. The method of Claim 69, wherein the step of contacting the second microorganism with a proliferation- 
inhibiting amount of said nucleic acid sequence comprises directly contacting said second microorganism with said nucleic 
acid. 

77. The method of Claim 69, wherein the step of contacting the second microorganism with a proliferation- 
inhibiting amount of said nucleic acid sequence comprises expressing an antisense nucleic acid to said homolog in said 
second microorganism. 

78. A compound identified using the method of Claim 68. 

79. A method of assaying a compound for the ability to inhibit proliferation comprising: 

(a) identifying an inhibitory nucleic acid sequence which inhibits the activity of a gene or gene 
product required for proliferation in a first microorgansim; 

(b) contacting a second microorganism with a proliferation-inhibiting amount of said inhibitory 
nucleic acid, thus sensitizing said second microorganism; 

(c) contacting the proliferation-inhibited microorganism of step (b) with a compound; and 

{d> determining whether said compound inhibits proliferation of said sensitized second 
microorganism to a greater extent than said compound inhibits proliferation of a nonsensitized second microorganism. 

80. The method of Claim 79, wherein said inhibitory nucleic acid is an antisense nucleic acid which inhibits 
the proliferation of said first microorganism. 

81. The method of Claim 79, wherein said inhibitory nucleic acid comprises a portion of an antisense 
nucleic acid which inhibits the proliferation of said first microorganism. 

82. The method of Claim 79, wherein said inhibitory nucleic acid comprises an antisense molecule against 
the entire coding region of the gene involved in proliferation of the first microorganism. 

83. The method of Claim 79, wherein said inhibitory nucleic acid comprises an antisense nucleic acid to a 
portion of the operon encoding the gene involved in proliferation of the first microorganism. 

84. A compound identified using the method of Claim 79. 

85. A method for assaying compounds for activity against a biological pathway required for proliferation 
comprising: 

sensitizing a cell by expressing an antisense nucleic acid against a nucleic acid encoding a gene product 
required for proliferation in a cell to reduce the activity or amount of said gene product; 
contacting the sensitized cell with a compound; and . .. 

determining whether said compound inhibits the growth of said sensitized cell to a greater extent than 
said compound inhibits the growth of an nonsensitized cell. 
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86. The method of Claim 85, wherein said cell is selected from the group consisting of bacterial cells, 
fungal cells, plant cells, and animal cells. 

87. The method of Claim 86, wherein said cell is an £ coli cell 

88 The method of Claim 85, wherein said cell is from an organism selected from the group consisting of 
5 Staphylococcus aureus, Pseudomonas aeruginosa, Enterobacter cloacae, Helicobacter pylori, Neisseria gonorrhoeae, 

Enterococcus faecalis, Streptococcus pneumoniae, Haemophilus influenzae, Salmonella typhimurium, Saccharomyces 
cerevisiae, Candida albicans, Cryptococcus neoformans, Aspergillus fumigatus, Klebsiella pneumoniae, Salmonella typhi, 
Salmonella paratyphi. Salmonella cholerasuis, Staphylococcus epidermidis, Mycobacterium tuberculosis, Mycobacterium 
leprae, Treponema pallidum, Bacillus anthracis, Yersinia pestis, Clostridium botulinum, Campylobacter jejuni, and Chlamydia 
1 0 trachomatus. Chlamydia pneumoniae or any species falling within the genera of any of the above species. 

89. The method of Claim 85, wherein said antisense nucleic acid is transcribed from an inducible promoter. 

90. The method of Claim 89, further comprising contacting the cell with an agent which induces expression 
of said antisense nucleic acid from said inducible promoter, wherein said antisense nucleic acid is expressed at a sublethal 
level. 

15 91 . The method of Claim 90, wherein said sublethal level of said antisense nucleic acid inhibits proliferation 

by 8% or more. 

92. The method of Claim 90, wherein said agent is isopropyl-1-thio-P-D-galactoside (IPTG). 

93. The method of Claim 91, wherein inhibition of proliferation is measured by monitoring the optical 
density of a liquid culture. 

20 94. The method of Claim 85, wherein said gene product comprises a polypeptide having a sequence 

selected from the group consisting of SEQ 10 NOs: 243*357, 359-398. 

95. A compound identified using the method of Claim 85. 

96. A method for assaying a compound for the ability to inhibit cellular proliferation comprising: 
contacting a ceil with an agent which reduces the activity or level of a gene product required for 

25 proliferation of said cell; 

contacting said cell with said compound; and 

determining whether said compound reduces proliferation to a greater extent than said compound 
reduces proliferation of cells which have not been contacted with said agent. 

97. The method of Claim 96, wherein said agent which reduces the activity or level of a gene product 
30 required for proliferation of said cell comprises an antisense nucleic acid to a gene or operon required for proliferation. 

98. The method of Claim 96, wherein said agent which reduces the activity or level of a gene product 
required for proliferation of said cell comprises an antibiotic. 



•84- 



WO 00/44906 



PCT/US00/02200 



99. The method of Claim 96, wherein said cell contains a temperature sensitive mutation which reduces the 
activity or level of said gene product required for proliferation of said ceil. 

100. The method of Claim 99, wherein said antisense nucleic add is directed against the nucleic acid 
encoding the same functional domain of said gene product required for proliferation of said cell to which said antisense 
nucleic acid is directed. 

101. The method of Claim 99, wherein said antisense nucleic acid is directed against the nucleic acid a 
different functional domain of said gene product required for proliferation of said cell than the functional domain to which 
said antisense nucleic acid is directed. 

102. A compound identified using the method of Claim 96. 

101 A method for identifying the pathway in which a proliferation-required nudeic add or its gene product 
lies comprising: 

expressing a sublethal level of an antisense nucleic acid directed against said proliferation-required nucleic acid in 

a cell; 

contacting said cell with an antibiotic, wherein the biological pathway on which said antibiotic acts is known; 

and 

determining whether said ceil has a substantially greater sensitivity to said antibiotic than a cell which does not 
express said sublethal level of said antisense nucleic acid. 

1 04. A method for determining the pathway on which a test compound acts comprising: 

(a) expressing a sublethal level of an antisense nucleic acid directed against a proliferation-required nucleic acid in 
a cell, wherein the biological pathway in which said proliferation-required nucleic acid lies is known, 

(b) contacting said ceil with said test compound; and 

(c) determining whether said cell has a substantially greater sensitivity to said test compound than a cell which 
does not express said sublethal level of said antisense nucleic acid. 

1 05. The method of Claim 1 04, further comprising: 

(d) expressing a sublethal level of a second antisense nucleic acid directed against a second 
proliferation-required nucleic acid in said cell, wherein said second proliferation-required nucleic acid is in a 
different biological pathway than said proliferation-required nucleic acid in step (a); and 

(e) determining whether said cell has a substantially greater sensitivity to said test compound than a 
cell which does not express said sublethal level of said second antisense nucleic acid. 

106. A purified or isolated nucleic acid consisting essentially of one of SEQ ID NOs: 358, 399402. 

1 07. A compound identified using the method of Claim 23. 

108. A compound which interacts with the gene or gene product of a nucleic acid comprising a sequence of 
one of SEQ ID NOs: 82-88, 90-242 to inhibit proliferation. 
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109. A compound which interacts with a polypeptide comprising one of SEQ ID NOs. 243-357, 359-398 to 
inhibit proliferation. 

110. A compound which interacts with a nucleic acid comprising one of SEQ ID NOs: 358, 399-402 to inhibit 
proliferation. 
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SEQUENCE LISTING 

<110> ELITRA PHARMACEUTICALS , INC . 
Zyskind, Judith 
Ohlsen, Kara L. 
Trawick, John 
Forsyth, R. Allyn 
Froelich, Jamie M. 
Carr, Grant J. 
Yamamoto, Robert T. 
Xu, H. Howard 



<120> GENES IDENTIFIED AS REQUIRED FOR PROLIFERATION IN 
ESCHERICHIA COLI 

<130> ELITRA. O01VPC 

<160> 485 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 159 
<212> DNA 
<213> E. Coli 

<400> 1 

caggtggtat ggaaacccaa aatggagacg ggaagctgaa ccagatagtt actggaggtg 60 

atcaccagca gatgaaataa cgataaccag aacaacgcct tatagcgttg agtttgcgag 120 

aaaacgttca tattgtacct ttttgattaa ccattgggg ' J 159 

<210> 2 
<211> 696 
<212> DNA 
<213> E. Coli 

<220> 

<221> misc_feature 

<222> (1) (696) 

<223> n - A, T, C or G 

<400> 2 

gattacatca agcgcgcggt gggtttaccg ggcgataaag tcacttacga tccggtctca 60 

aaagagctga cgattcaacc gggatgcagt tccggccagg cgtgtgaaaa cgcgctgccg 120 

gtcacctact caaacgtgga accgagcgat ttcgttcaga ccttctcacg ccgtaatggt 180 

ggggaagcga ccagcggatt ctttgaagtg ccgaaaaacg aaaccaaaga aaatggaatt 240 

cgtctttccg agcgtaaaga gacactgggt gatgtgacgc accgcattct gacagtgccg 300 

attgcgcagg atcaggtggg gatgtattac cagcagccag ggcaacaact ggcaacctgg 360 

attgttcctc cgggacaata cttcatgatg ggcgacaacc gcgacaacag cgcggacagc 420 

cgttactggg gctttgtgcc ngaagcgaat ctggtcggtc nggcaacggc tatctggatg 480 

aacttcgata accaagaagg cgaatggccg aatggtctgc cctaantcgc attggcgnnt 540 

ccnttaatan ccacttcctt cnctttgtcc ccttatggca acacttaatt tattntaaan 600 

taatcncccg tggctnacaa atccccgcct tttnttaaaa atttccccna anttaaggtt 660 

ggcctccagt tgcccgnccc aaacactttg gncccc 595 

<210> 3 
<211> 681 
<212> DNA 
<213> E. Coli 
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<220> 

<221> misc_feature 
<222> (1) . . . (681) 
<223> n = A,T,C or G 



<400> 3 

ctgcagggta atgtcgccat taaactggcg caggcagcca aagagttgct ccgcttctac 60 

ccagtcggca gcgacaactt gcgttaaagt cgcaaaatta tcatctgcac tcactgcgtg 120 

acgtaagcgg atggagtggc cggaaacctc atagtgaccg cccaccagtt ggcctgcatc 180 

gctttgtagc gtacgcgcgg cattggcaat aagattcaga tactcagact cttccggggc 240 

cttcgccagc ataaaagagg aggatgctcg cgtatgcagc aactgctcca gcgcaaattg 300 

cagccgcggt tgagtatcac tgaataaagg atcgttttcg tcaatcaaat gtggctgagc 360 

aaatatttcc tgatagctat cggtatcagg aaccaggtca cgccatgcaa gtttcgtaat 420 

ggtcaaagtt gatgtttttt agtctgttgt caaagccgcn attataccng taaccggcac 480 

tacagcacac gtagaaagca cccgacaata ctcctggcat gggcgttaaa gctcacagga 540 

tggagatctt ttcttcactg gcctaaaaag ctgatattct gtaaagagtt acacngtaac 600 

attgagatcg ctatgaaata tcaacaactt ggaaaatctt gnaaagcngg ttggaaaatg 660 

gaaagtatct ggttaagaag c 681 



<210> 4 
<211> 289 
<212> DNA 
<213> E. Coli 



<400> 4 

ggcagaattt tacgctgacc aatgacgcga cgacgtggca tggaaatact ccgttgttaa 60 

ttcaggattg tccaaaactc tacgagttta gtttgacatt taagttaaaa cgtttggcct 120 

tacttaacgg agaaccatta agccttagga cgcttcacgc catacttgga acgagcctgc 180 

ttacggtctt taacgccgga gcagtcaagc gcaccacgta cggtgtggta acgaacaccc 240 

gggaggtctt taacacgacc gtcacggatc aggatcacgg agtgctcct 289 

<210> 5 
<211> 815 
<212> DNA 
<213> E. Coli 



<220> 

<221> misc_feature 
<222> (1) . ! . (815) 
<223> n = A, T,C or G 



<400> 5 

gggagcttac atcagtaagt gaccgggatg agcgagcgaa gataacgcat ctgcggcgcg 60 

aaatatgaag ggggagagcc cttatagacc aggtagtaca cgtttggtta gggggcctgc 120 

atatggcccc ctttttcact tttatatctg tgcggtttaa tgccgggcag atcacatctc 180 

cgaggatttt agaatggctg aaattaccgc atccctggta aaagagctgc gtgagcgtac 240 

tggcgcaggc atgatggatt gcaaaaaagc actgactgaa gctaacggcg acatcgagct 300 

ggcaatcgaa aacatgcgta agtccggtgc tattaaagca gcgaaaaaag caggcaacgt 360 

tgctgctgac ggcgtgatca aaaccaaaat cgacggcaac tacggcatca ttctggaagt 420 

taactgccag actgacttcg ttgcaaaaga cgctggtttc caggcgttcg cagacaaagt 480 

tctggacgca gctgttgctg gcaaaatcac tgacgttgaa gttctgaaag cacagttcga 540 

agaagaacgt gttgcgctgg tagcgaaaat tggtgaaaac atcaacattc gccgcgttgc 600 

tgcgctggaa ggcgacgttc tgggttctta tcagcacggt gcgcgtatcg gccgttctgg 660 

ttgctgctaa aagcgctgac gaagaactgg ttaaacacat cgttttgacc tttgttgcaa 720 

gccaagccag aattcagaga aactttccgc ttcaccggag gtcccaccca cangganccc 780 

cgattttntc agcatggtgg tcttcctncg gagtt 815 



<210> 6 
<211> 403 
<212> CNA 
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<213> E . Coli 

<400> 6 
caacactatt ttgttgaccg 
ttaaaccatt tcattgcgat 
cgatagaaac aagcattgaa 
ccttacagcg caaaaaggct 
gctgcaaccg gaagggttgg 
ttttttcagt gcttctgcgt 
ttctaccagg tctttagctt 

<210> 7 
<211> 149 
<212> DNA 
<213> E. Coli 

<400> 7 
gagctttttt cagtgcttct 
cagattctac caggtcttta 
tgataacagc aactttgtta 

<210> 8 

<211> 742 

<212> DNA 

<213> E . Coli 

<220> 

<221> misc_f eature 

<222> (1) . . . (742) 

<223> n « A,T,C or G 

<400> 8 

ccatctgtcc attgagcgga cagtttgtgc aacactattt tgttgaccgg aaaatggaac 60 

actttccgca atgcctgttg ctatcacgct taaaccattt cattgcgatt tacacagaac 120 

ggacgtcctg tcgcagtata ttaagtcgtc gatagaaaca agcattgaaa ggcacagcag 180 

tagtcaaaca gtgtgaaacg ctactggcgc cttacagcgc aaaaaggctg gtgactaaaa 240 

agtcaccagc catcagcctg atttctcagg ctgcaaccgg aagggttggc ttatttaact 300 

tcaacttcag cgccagcttc ttccagagct tttttcagtg cttctgcgtc gtctttgctc 360 

acgccttctt tcagagcagc cggtgcagat tctaccaggt ctttagcttc tttcagaccc 420 

aggccagttg cgccacgtac tgctttgata acagcaactt tgttagcgcc agcagctttc 480 

agaattacgt cgaattcagt tntttcttca gcagcttcaa ccgggccagc agctacagct 540 

acagcagcag caagcggaaa caccgaattt ttcttccatt gcagagatca gttctacaac 600 

cgtccattac agacatagct gcaactgctt caatgatttt gatctttagt ggatagacat 660 

ttaaattgtt cctgaattat caagaaataa gtnttatacg taagccgaaa tgcgttaaaa 720 

aagataactg ngattaaagc ag 742 

<210> 9 
<211> 421 
<212> DNA 
<213> E. Coli 

<400> 9 
agtagtcaaa cagtgtgaaa 
aaagtcacca gccatcagcc 
cttcaacttc agcgccagct 
tcacgccttc tttcagagca 
ccaggccagt tgcgccacgt 
tcagaattac gtcgaattca 
ctacagcagc agcagcggaa 
c 



gaaaatggaa 
ttacacagaa 
aggcacagca 
ggtgactaaa 
cttatttaac 
cgtctttgct 
ctttcagacc 



cactttccgc 
cggacgtcct 
gtagtcaaac 
aagtcaccag 
ttcaacttca 
cacgccttct 
caggccagtt 



aatgcctgtt 
gtcgcagtat 
agtgtgaaac 
ccatcagcct 
gcgccagctt 
ttcagagcag 
gcg 



gctatcacgc 
attaagtcgt 
gctactggcg 
gatttctcag 
cttccagagc 
ccggtgcaga 



60 
120 
180 
240 
300 
360 
403 



gcgtcgtctt tgctcacgcc ttctttcaga gcagccggtg 60 
gcttctttca gacccaggcc agttgcgcca cgtactgctt 120 
gcgccagca 149 



cgctactggc gccttacagc gcaaaaaggc tggtgactaa 60 

tgatttctca ggctgcaacc ggaagggttg gcttatttaa 120 

tcttccagag cttttttcag tgcttctgcg tcgtctttgc 180 

gccggtgcag attctaccag gtctttagct tctttcagac 240 

actgctttga taacagcaac tttgttagcg ccagcagctt 300 

gttttttctt cagcagcttc aaccgggcca gcagctacag 360 

acaccgaatt tttcttccat tgcagagatc agttctacaa 420 
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<210> 10 
<211> 126 
<212> DNA 
<213> E. Coli 

<400> 10 

agagcttttt tcagtgcttc tgcgtcgtct ttgctcacgc cttctttcag agcagccggt 60 

gcagattcta ccaggtcttt agcttctttc agacccaggc cagttgcgcc acgtactgct 120 

ttgata " 126 

<210> 11 
<211> 262 
<212> DNA 
<213> E. Coli 

<220> 

<221> misc_feature 
<222> (1)...(262) 
<223> n — A,T,C or G 

<400> 11 

ctgcaaccgg aagggttggc ttatttaact tcaacttcag cgccagcttc ttccagagct 60 

tttttcagtg cttctgcgtc gtctttgctc acgccttctt tcagagcagc cgntgcagat 120 

tctaccaggt ctttagcttc tttcagaccc aggccagttg cgccacgtac tgctttgata 180 

acagcaactt tgttagcgcc agcagctttc agaattacgt cgaattcagt tttttcttca 240 

gcagcrtcaa ccgggccagc ag 262 

<210> 12 
<211> 202 
<212> DNA 
<213> E. Coli 

<220> 

<221> misc_feature 
<222> (1) . . . (202) 
<223> n = A,T,C or G 

<400> 12 

gcgca-accc tgcagcatcg gcccgatgga gatcaggtcg gcagaacgct gtaccgcttt 60 

gtagg-ggtg ttaccggtgn tcagatccgg gaagatgaac acggtagcgc gacctgcaac 120 

cggacagttc ggcgctttgg attncgcaac gtcagccatt accgcagcgt cgtactgcag 180 

cggaccggcg atcatcaggt ca 202 

<210> 13 

<211> 261 

<212> DNA 

<213> E. Coli 

<400> 13 

tctaggagta agaatagctt caaattcagc agttgacagt ggcataaacg taactggtga 60 

cttttgcccg gcatgacgcc gggctttttt tattattccg tgacttccag cgtagtgaag 120 

gcaaacttct cgccatcaaa tagcccctga ctggttagtt ttagcgcggg gatcactggc 180 

agagaaagaa acgccatctg aataaacggc tcatcgggta acggaccgca ttcacgggcg 240 

gcggc-ttca aggcgtcaat t 261 

<210> 14 

<211> 224 

<212> DNA 

<213> E. Coli 



<400> 14 
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ttcttttttt cgtcaacggt gtccagaatc attttattta cctcggggta cttatgctga 60 

tttttattat tatggggaag gtgttattta tgagtttcat ttatgccgta acgacaatga 120 

actcgggaat tagtataagc agcgcgagaa taataatcat tgtgcaaatg ctaatttaat 180 

taatactatt taaatattat tttgagcata tgcacataag gttg 224 

<210> 15 
<211> 232 
<212> DNA 
<213> E. Coli 

<400> 15 

aattcccttc tttttttcgt caacggtgtc cagaatcatt ttatttacct cgggtactta 60 

tgctgatttt tattattatg gggaaggtgt tatttatgag tttcatttat gccgtaacga 120 

caatgaactc gggaattagt ataagcagcg cgagaataat aatcattgtg caaatgctaa 180 

tttaattaat actatttaaa tattattttg agcatatgca cataaggttg gg 232 

<210> 16 
<211> 212 
<212> DNA 
<213> E. Coli 

<400> 16 

aatagcgggt atgcacgcct ttcttttttt cgtcaacggt gtccagaatc attttattta 60 

cctcgggtac ttatgctgat ttttattatt atggggaagg tgttatttat gagtttcatt 120 

tatgccgtaa cgacaatgaa ctcgggaatt agtataagca gcgcgagaat aataatcatt 180 

gtgcaaatgc taatttaatt aatactattt aa 212 

<210> 17 

<211> 433 

<212> DNA 

<213> E. Coli 

<400> 17 

ccttgtaaat tatcgcccgt ggcataaaaa ctgcgtccaa acgccgtctt tgccagcagc 60 

caggccataa atgccaccag aattatcgtc aaccaaccaa ttgctgaaac gccaagcagc 120 

agcggggcgg agagctgttt cagttcggcg ggtaaccctt caatccattt gccgccagtc 180 

cacagcaaca tgatgcctct gtacaaccct aacgtgccaa gggtggcaac aatggcaggg 240 

atctttagcc acgcgaccag gacaccgttg aaaaatcccg cgagcaaacc aagcagtaaa 300 

gtcgcgacac aagcaacagg tagtgaatat cctgcgttca gtaacatccc caacagcacc 360 

gcgcacattc cggtaatcga acccactgaa acatcaatat tgcgcgtaag cattaccagc 420 

gtcgcgccca ttg 433 

<210> 18 

<211> 658 

<212> DNA 

<213> E. Coli 

<400> 18 

cgtgcgcttc cggttgtggc aacccgcgaa atggcgcggc ggtaagtatg gcggggttat 60 

tccttccccg ttgaggacac cgggttgtca ggttgaccat acgcttaagt gacaaccccg 120 

ctgcaacgcc ctctgttatc aattttctgg tgacgtttgg cggtatcagt tttactccgt 180 

gactgctctg ccgccctttt taaagtgaat tttgtgatgt ggtgaatgcg gctgagcgca 240 

cgcggaacag ttaaaaccaa aaacagtgtt atgggtggat tctctgtatc cggcgttaat 300 

tgttaactgg ttaacgtcac ctggaggcac caggcactgc atcacaaaat rcattgttga 360 

ggacgcgata atgaaaacgc tattaccaaa cgttaatacg tctgaaggct gttttgaaat 420 

tggtgtcact atcagtaacc cagtatttac tgaagatgcc attaacaaga gaaaacaaga 480 

acgggagcta ttaaataaaa tatgcattgt ttcaatgctg gctcgtttac gtctgatgcc 540 

aaaaggatgt gcacaatgaa ttcagcattt gtgcttgttc tgacagtttt tcttgtttcc 600 

ggagagccag ttgatattgc agtcaagtgg tcacaggaca atgcaggagt gtatgact 658 

<210> 19 
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<211> 588 
<212> DNA 
<213> E. Coli 

<220> 

<221> misc_f eature 

<222> (1) . . . (588) 

<223> n « A,T,C or G 

<400> 19 

gtgactgctc tgccgccctt tttaaagtga attttgtgat gtggtgaatg cggctgagcg 60 

cacgcggaac agttaaaacc aaaaacagtg ttatgggtgg attctctgta tccggcgtta 120 

attgttaact ggttaacgtc acctggaggc accaggcact gcatcacaaa attcattgtt 180 

gaggacgcga taatgaaaac gttattacca aacgttaata cgtctgaagg ttgttttgaa 240 

attggtgtca ctatcagtaa cccagtattt actgaagatg ccattaacaa gagaaaacaa 300 

gaacgggagc tattaaataa aatatgcatt gtttcaatgc tggctcgttt acgtctgatg 360 

ccaaaaggat gtgcacaatg aattcagcat ttgtgcttgt tctgacagtt tttcttgttt 420 

ccggagagcc agttgatatt gcagtcagtg ttcacaggac aatgcangag tgtatgactg 480 

cagcaacccg aacagaaaat tcccggtaac tgttacccgg tcgataaagt tattcaccag 540 

gataatatcg aaatcccggc aggtctttaa aacagttccg taataaat 588 

<210> 20 
<211> 101 
<212> DNA 
<213> E. Coli 

<400> 20 

gatccagcaa gaagatgcgg ttgtaccgtc atcacgcaga tgcgcaaagc tactcagcaa 60 
ctgacctttc ttcgcaataa gcacgccatt agcgtcatag a 101 

<210> 21 
<211> 465 
<212> DNA 
<213> E. Coli 

<400> 21 

tcgcgtgttt accttcaaca tcggtaactt tctggcggat agtttcacgg taagcaacct 60 

gcggtttacc tacgttcgct tcaacgttga attcacgctt catacggtca acgatgatgt 120 

cgaggtgcag ttcgcccata cccgcgatga tggtctggtt agattcttcg tcagtccata 180 

cacggaaaga cgggtcttct ttagccagac ggcccagagc cagacccatt ttttcctggt 240 

cagctttggt tttcggttca actgcgatgg agattaccgg ctcagggaat tccatacgtt 300 

ccagaatgat cggcgcatcc gggtcacaca gggtgtcacc agtggttacg tctttcagac 360 

cgatagcagc agcgatgtcg cccgcgcgaa cttctttgat ctcttcacgt ttgttagcgt 420 

gcatctgaac gatacgaccg aaacgctcac gtgcagcttt cacgg 4 65 

<210> 22 
<211> 859 
<212> DNA 
<213> E. Coli 

<220> 

<221> misc_feature 
<222> (1)...(859) 
<223> n - A,T,C or G ' 

<400> 22 

tgatcggctc aagcagaact ggtttcgctt tcttaaagcc ttctttaaag gcgatagaag 60 

cagccagttt aaacgccagt tcagaggagt caacgtcatg gtaagaaccg aagtgcagac 120 

gaatacccat gtctactacc gggtagcctg ccagcggacc tgctttcagc tgttcctgga 180 

tacctttatc aacggccggg atgtattcgc cagggattac accaccttta atgtcgttga 240 

tgaactcgta gcctttcggg tttgaacccg gctccagcgg gtacatgtcg ataacaacat 300 
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gaccatactg 
tctggcggat 
attcacgctt 
tggtctggtt 
gggccanagc 
gattaccggc 
anggngtacc 
gccnaacttc 
aaaagngtta 
cggggtnaac 



accacgacca 
agtttcacgg 
catacggtca 
agattcttcg 
cagacccatt 
tcanggaatt 
aggggggtac 
tttggaacnn 
anngccantt 
ccngnaaaa 



ccagactgtt 
taagcaacct 
acgatgatgt 
tcagtccata 
ttttcctggt 
tccatacctt 
ntntttttaa 
tttaccggtt 
ttccnggngt 



tcgcgtgttt 
gcggtttacc 
cgaggtgcag 
cacggaaaga 
cagctttggt 
ccaggaatga 
nancgattgc 
ggtaaccngc 
tnanntncgg 



accttcaaca 
tacgttcgct 
ttcgcccata 
cgggtcttct 
tttcggtcaa 
tcggcgcatt 
cagcancgga 
cttttnaacn 
nttcccngaa 



tcggtaactt 
tcaacgttga 
cccgcgatga 
ttagccagac 
ctgcgatgga 
ccggtcaaac 
tntnncccgn 
atccaaccga 
ntaacccncc 



<210> 23 
<211> 269 
<212> DNA 
<213> E. Coli 

<400> 23 

ctttcttaaa gccttcttta aaggcgatag aagcagccag tttaaacgcc agttcagagg 

agtcaacgtc atggtaagaa ccgaagtgca gacgaatacc catgtctact accgggtagc 

ctgccagcgg acctgctttc agctgttcct ggataccttt atcaacggcc gggatgtatt 

cgccagggat tacaccacct ttaatgtcgt tgatgaactc gtagcctttc gggtttgaac 
ccggctccag cgggtacatg tcgataaca 

<210> 24 

<211> 330 

<212> DNA 

<213> E. Coli 



360 
420 
480 
540 
600 
660 
720 
780 
840 
859 



60 
120 
180 
240 
269 



<400 
gttttgggga 
atgactgatt 
ctgatccttc 
cggacgcacc 
gtgaatgatt 
aatataatta 



> 24 
gatgtaaggg 
gccgatacct 
tgttcttata 
tttaataact 
atgctaatgt 
atactctact 



ctaatctgaa 
gattaaacgg 
acacaaggaa 
ataaataagt 
catcaattaa 
tccagagtag 



tggctgcatt 
gtcatcaaaa 
acgtacttaa 
gtctgggcag 
ataaatataa 



ccttgtttaa 
tcatcattgc 
ggtgcgtccg 
atactatata 
tggcgttaag 



ggaaaaacga 
tgttttacag 
gtgaaccagt 
aattaactta 
gcttcccagt 



<210> 25 
<211> 471 
<212> DNA 
<213> E. Coli 

<220> 

<221> misc_feature 
<222> (1) . . . (471) 
<223> n - A,T,C or G 



60 
120 
180 
240 
300 
330 



<400 
gttttgggga 
atgactgatt 
ctgatccttc 
tcggacgcac 
agtgaatgat 
taatataatt 
agcacaaatt 
tantattggg 



> 25 
gatgtaaggg 
gccgatacct 
tgttcttata 
ctttaataac 
tatgctaatg 
aatactctac 
tatcccacaa 
ggattnggcc 



ctaatctgaa 
gattaaacgg 
acacaaggaa 
tataaataag 
tcatcaatta 
ttccagagta 
ctgttcttct 
cncctttttg 



tggctgcatt 
gtcatcaaaa 
acgtacttaa 
tgtctgggca 
aataaatata 
gaatattaaa 
gtctcgacat 
ncaggttggg 



ccttgtttaa 
tcatcattgc 
ggtgccgtcc 
gatactatat 
atggcgttaa 
ttttatccgc 
gccccccgat 
gtcntctnat 



<210> 26 
<211> 379 
<212> DNA 
<213> E. Coli 



ggaaaaacga 
tgttttacag 
ggtgaaccag 
aaattaactt 
ggcttcccag 
gtggtgcatc 
ctttnacaaa 

g 



60 
120 
180 
240 
300 
360 
420 
471 
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<220> 

<221> misc_f eature 

<222> (1) . . . (379) 

<223> n = A,T,C or G 



<400> 26 

natctgantg gctgcattcc 

ttaaacgggt catcaaaatc 

acaaggaaac gtacttaagg 

aaataagtgt ctgggcagat 

tcaattaaat aaatataatg 

cagagtagaa tattaaattt 

ttcttctgtc tcgacatgc 

<210> 27 

<211> 799 

<212> DNA 

<213> E. Coli 



ttgtttaagg 
atcattgctg 
tgcgtccggt 
actatataaa 
gcgttaaggc 
tatccgcgtg 



aaacccgaat 
ttttacagct 
gaaccagtcg 
ttaacttagt 
ttcccagtaa 
gtgcatcagc 



gactgattgc 
gatccttctg 
gacgcacctt 
gaatgattat 
tataattaat 
acaaatttat 



cgatacctga 
ttcttataac 
taataactat 
gctaatgtca 
actctacttc 
cccacaactg 



60 
120 
180 
240 
300 
360 
379 



<400> 27 

aaagatgatg tgatgagaaa gtcaatttga ataagacaat attaagagct aaaaaaatgt 60 

caaaaaacac taaatcaaaa aataatggca ttagaaaata taatgcgaaa acggaggtga 120 

aattagttta tttcaaatga ggaaaatctc ccggcgaaaa aaccgggaga tgaaagtgtg 180 

atgggtatca aataaacaac agaggagaaa tttttaacgc agccattcag gcaaatcgtt 240 

taatcccatt gcctggcgga taagttgcgg cttaacgcca ggaagcgtgt cggccagttt 300 

caaaccaata tcacgcagca gttttttcgc cggattggta ccggaaaaca gatcgcggaa 360 

tccctgcata ccagccagca tcaacgccgc actgtgcttg cggctacgct catagcgacg 420 

cagataaatg tactgcccga tgtctgggat ccgtcgacct gcagccaagc ttgggctttt 480 

cagcctgata cagattaaat cagaacgcag aagcggtctg ataaaacaga atttgcctgg 540 

cggcagtagc gcggtggtcc cacctgaccc catgccgaac tcagaagtga aacgcccgta 600 

gcgcccgatg gtagtgtggg gtctccccat gcgagagtag ggaactgcca ggcatcaaat 660 

aaaacgaaag gctcagtcga aagactgggc ctttcggttt atctggtggt tgtcggtgaa 720 

cgctctctga gtaggacaaa tccgccggga gcggattttg aacgttgcga aacaaccggc 780 

ccggaaaggg gtgggggct 799 



<210> 28 
<211> 636 
<212> DNA 
<213> E. Ccli 



<220> 

<221> misc_feature 
<222> (1) . . . (636) 
<223> n = A, T, C or G 



<400> 28 

agggggtttg ttgtgggcaa tgatgcattt aagttatcgt ctgcagatag aggagatatt 60 

acaataaaca acgaancagg gcatttgata gtcaataccg caattctatc aggagatata 120 

gtcactctaa gaggaggaga aattaggttg gtattatagc ttgtgcgcgc catgattggc 180 

gcgcaattta aacttagtgc tttacatcgc tattgtcttg atttctttga attattttat 240 

aaattaaaaa aacgactgtt atgtataagc aaaggtcgaa cgaaaaatac attccaaata 300 

aatgcctgct taaatcrcta tatccttccc cgaaaaatga cacataaaat tgagatattc 360 

caaaaagaga tactacaaat aaagatgcct ttattttatt atttctaata aaaatagaag 420 

caataaaaaa taataacaat gatataaatc taatgttttt aaatatattg tcttttatgt 480 

tagtaatagt cgttagtatg tttgattctc catatattac gtgtagttt.t ttatatacat 540 

ggaaataatt ntctttatac tgagacatca caccatcatc aaatggaagt ttgaagatgg 600 

tgcttggttt gctaaccaat aaaaagagtg cattcg 636 



<210> 29 
<211> 757 
<212> DNA 



-8- 



WO 00/44906 



PCTYUS00/02200 



<213> E . Coli 



<220> 

<221> misc__f eature 
<222> (1) . . . (757) 
<223> n « A, T,C or G 



<400> 29 

cagcggtcgt atttttagca tggtttttta ttggcggcta tgctgccccg ggagcataaa 60 

gatgaaaaaa acaacgatta ttatgatggg tgtggcgatt attgtcgtac tcggcactga 120 

gctgggatgg tggtaacgtc acctctaaaa aatagcaaag gctgcctgtg tgcagccttt 180 

gtgcaattta agcgttaact tttaatcttc ctgtagataa atagcacgac aatcgcacca 240 

ataacggcaa ccacgaagct gccaaaattg aagccatcga ctttaccaaa gccaaacagc 300 

gtgctgatcc atccgccgac tacggcaccg actatcccca gcaggatagt cataaagaat 360 

ccacctccat ctttacctgg catgatccac ttcgccagaa taccggcaat aagcccaaaa 420 

ataatccatg acagaatgcc cattgtttcc tcacttatct gttttgcatt agcgggttag 480 

tcgctgataa aaagcatagc acaacatcgg gagggcaaga tttgtgacga gcatcacgga 540 

ggtttttttt gcgatggcgc agaaattgcg ccatcaacga tcagtgataa ttaccaacca 600 

caaacatcat gttcgttttc cgtgtcataa gaaccgtacg ggattcacca gatcttttat 660 

cacttcaagc cggcacttct ggcaccagca aagtcatcgg cgtctctggt tcataatcga 720 

ccggaaacgc cattgctggt attggtgacn gtcacgg 757 



<210> 30 
<211> 392 
<212> DNA 
<213> E. Coli 



<400> 30 

aattacagaa aaaggaggca atatcgggta aaggcattag cccgacgaat acgtcgggct 60 

acaaatatta ttgtgctgca ggtgttttag cgggttgttg atccacaggt tctaactgga 120 

agaccacatc gacctgatca tcaaactgaa tagcggcctg ctcgtaagtt tcctgggcgg 180 

acaccggcgc ggcatcggct ttcatcatcc gcaccattgg gctgggctga tagttggaaa 24 0 

catggtagcg cacgctatat accggcccca gtttacgatg aaagccgttc gccagttcct 300 

gcgcctgatg aatcgcgtta tcaatcgctg ccttacgcgc tttgtcttta taggcatccg 360 

gctgcgccac gcccagcgac acagaacgaa tt 392 



<210> 31 
<211> 351 
<212> DNA 
<213> E . Coli 



<400> 31 

ctatccttga tgaaaccgcg agcaaagata ggtgattacg tcatggtttt acagaaaatt 60 

acagaaaaag gaggcaatat cgggtaaagg cattagcccg acgaatacgt cgggctacaa 120 

atattattgt gctgcaggtg ttttagcggg ttgttgatcc acaggttcta actggaagac 180 

cacatcgacc tgatcatcaa actgaatagc ggcctgctcg taagtttcct gggcggacac 240 

cggcgcggca tcggctttca tcatccgcac cattgggctg ggctgatagt tggaaacatg 300 

gtagcgcacg ctatataccg gccccagttt acgatgaaag ccgttcgcca g 351 

<210> 32 
<211> 762 
<212> DNA 
<213> E. Coli 



<220> 

<221> misc_f eature 
<222> (1) . . . (762) 
<223> n = A,T,C or G 



<400> 32 

aattatgaaa cactgtctgg aatcgtctga atgacgggca catttgcgag cacgcatcca 



60 
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gtaataacac aggaaactat tttatctacg cgtcagcgat agactgcttg catggcgaaa 120 

ggaggtaagc cgacgatttc agcgggacgc tgaaacggga aagcccctcc cgaggaaggg 180 

gccataaata aggaaagggt catgatgaag ctactcatca tcgtggtgct cttagtcata 240 

agcttccccg cttactaaga ctaccagggc gggggaaacc ccgctctacc ctcactcctg 300 

aaagtatgcc ttcacgataa gattgtcaat ccgcaggctt tgtagtctgc gatcctgcca 360 

gcaaatattc tttgcgagtc gttacgcaat aatcacagag gaaactattt tattcacgcg 420 

ttagcgatag actgcattca gggcgaaagg aggtaagccg atgatttcag cgggacgctg 4 80 

aaacgggaaa gcctctcccg gagaagaggg cttttaataa ggaaagggtt atgatgaagc 540 

acgtcatcat actggtgata ctcttagtga ttagcttcca ggcttactaa gaacaccagg 600 

gggaggggga aacctcttcc taaccctcac ttctgaaatt gggtgctatg acgctggcgt 660 

tactgcttan cgctaccagt ttgtctgccc tggcggttgt aacgccagat cggtacccgt 720 

ttggatattt taatgaaagc cgacaaatca atcancgtga eg 762 

<210> 33 
<211> 293 
<212> DNA 
<213> E. Coli 



<400> 33 

geacatttge gagcaegcat ccagtaataa cacaggaaac tattttatct aegegttage 60 

gatagactgc ttgcatggcg aaaggaggta agecgacgat ttcagcggga cgctgaaacg 120 

ggaaagcccc tcccgaggaa ggggccataa ataaggaaag ggtcatgatg aagctactca 180 

tcatcgtggt gctcttagtc ataagcttcc ccgcttacta agactaccag ggcgggggaa 240 

accccgctct accctcactc ctgaaagtat gccttcacga taagattgtc aat 293 

<210> 34 
<211> 633 
<212> DNA 
<213> E. Coli 



<220> 

<221> misc_f eature 
<222> (1) . . . (633) 
<223> n = A, T, C or G 



<400> 34 

atttacactt tttacgaaat catgggatca ctaacaaaat atcgcttgtc agttatattg 60 

tatggcagga aagatatgcg actgatatta cagatcccca aagtggagag tttatgacca 120 

ttaaaaataa gatgttgctg ggtgcgcttt tgctggttac cagtgccgcc tgggccgcac 180 

cagccaccgc gggttcgacc aatacctegg gaatttctaa gtatgagtta agtagtttca 240 

ttgetgaett taagcatttc aaaccagggg acaccgtacc agaaatgtac cgtaccgatg 300 

agtacaacat taagcagtgg cagttgcgta acctgcccgc gcctgatgcc gggaegcact 360 

ggacctatat gggtggcgcg tacgtgttga tcagcgacac egaeggtaaa atcattaaag 420 

cctacgacgg tgagattttt tatcatcget aaaaaaagee ccctcatcat gagggggaaa 480 

tgcagacacc ttgntatttt ttattattag ccacttgctc gtcttgcttg gtattaagtc 540 

gtatttcacg ttgattaatg cnggtggctc cagtgcgcca gattaacttt gtttggatcg 600 

aagacgtagt aactggctgg ttatcggaat tgg " 633 

<210> 35 
<211> 569 
<212> DNA 
<213> E. Coli 



<400> 35 

tatggcagga aagatatgcg actgatatta cagatcccca aagtggagag tttatgacca 60 

ttaaaaataa gatgttgctg ggtgcgcttt tgctggttac cagtgccgcc tgggccgcac 120 

cagccaccgc gggttcgacc aatacctegg gaatttctaa gtatgagtta agtagtttca 180 

ttgetgaett taagcatttc aaaccagggg acaccgtacc agaaatgtac cgtaccgatg 240 

agtacaacat taagcagtgg cagttgcgta acctgcccgc gcctgatgcc gggaegcact 300 

ggacctatat gggtggcgcg tacgtgttga tcagcgacac egaeggtaaa atcattaaag 360 

cctacgacgg tgagattttt tatcatcget aaaaaaagee ccctcatcat gagggggaaa 420 
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tgcagacacc ttgttatttt ttattattag ccacttgctc gtcttgcttg ttattagtcg 480 
tatttcacgt tgattaatgc ggttgcctcc agtgcgccag atttaacttt gtttgtatcg 540 
tagacgtagt aactggctgg tatcggaat 559 

<210> 36 
<211> 338 
<212> DNA 
<213> E. Coli 

<400> 36 

cgtattcaca tccttttgat tggtgataac atgcgaatcg gtattatttt tccggttgta 60 

atcttcatta cagcggtcgt atttttagca tggtttttta ttggcggcta tgctgccccg 120 

ggagcataaa gatgaaaaaa acaacgatta ttatgatggg tgtggcgatt attgtcgtac 180 

tcggcactgc ctgggatggt ggtaacgtca cctctaaaaa atagcaaagg ctgcctgtgt 240 

gcagcctttg tgcaatttaa gcgttaactt ttaatcttcc tgtagataaa tagcacgaca 300 

atcgcaccaa taacggcaac cacgaagctg ccaaaatt 338 

<210> 37 
<211> 375 
<212> DNA 
<213> E . Coli 



<400 
ctgaatattt 
ataaaatatt 
caaagaagca 
ttttaattta 
atacataaat 
tccatctcta 
taaaacaact 



> 37 
aaaaaggaaa 
tccgtggagc 
ttgaatgcag 
acatgtaaac 
ggagtcatgt 
attgcatatt 
gtaca 



acgacatgaa 
attttattat 
ggaaaaataa 
gcatggttaa 
tttccctttt 
ttaatttttc 



accgaagcac 
tgaatataga 
tatggccata 
tcctcatatc 
ccatttatca 
tgataaatgg 



agaatcaaca 
ggtttaactc 
aaaaacatcg 
acgggtggag 
agttcctgtt 
cattgagcat 



ttctccaatc 
cggtaaaaaa 
aaagaaactc 
tgttaagaac 
gccgttttag 
cgatttcatt 



60 
120 
180 
240 
300 
360 
375 



<210> 38 
<211> 446 
<212> DNA 
<213> E. Coli 

<400> 38 

ttacgatagc tattagtaaa aatataagag ttagctgtat tgttatgtct gtggcgaaat 60 

tgactacctt cgtttttttg attaagaatg attttattat cgtaagtaaa attacatgaa 120 

tatttaaaaa ggaaaacgac atgaaaccga agcacagaat caacattctc caatcataaa 180 

atatttccgt ggagcatttt attattgaat atagaggttt aactccggta aaaaacaaag 240 

aagcattgaa tgcagggaaa aataatatgg ccataaaaaa catcgaaaga aactctttta 300 

atttaacatg taaacgcatg gttaatcctc atatcacggg tggagtgtta agaacataca 360 

taaatggagt catgttttcc cttttccatt tatcaagttc ctgttgccgt tttagtccat 420 

ctctaattgc atattttaat ttttct 446 

<210> 39 
<211> 392 
<212> DNA 
<213> E . Coli 

<220> 

<221> misc_f eature 
<222> (1) . . . (392) 
<223> n - A,T,C or G 



<400> 39 

tcaccccggt gccgattttc aggcatcctg atttaactta gcacccgcaa cttaactaca 60 

ggaaaacaaa gagataaatg rctaatcctg atgcaaatcg agccgatttt ttaatcttta 120 

cggactttta cccgcctggt ttattaattg cactgtnatc cgggcgttcg cccgctttaa 180 

tcacaatagg ctgtgtagcc tgggcctgtt tctctttcac ccgcgccaga gcggcagcaa 240 
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tcgcatcttt atctttggct gcaggttgaa cggctgcgct cttatgtcgt tcaaggcgag 300 
ccgcrrrttc gcgctccaga cgagcctggc gcgcttcgaa acgcgctttg gcttctgcgg 360 
cncgcrtttc ttcctgacga atagccgcaa tt 392 

<210> 40 

<211> 208 

<212> DNA 

<213> E. Coli 

<400> 40 

taataacgct atctgcggat aaagcagaat aggtggttaa ccccagacat aaaccgagga 60 

aaataatgtt attgtatttc ataatctatt gttccttagc gacagattgc tgtctgctgg 120 

ttcagtaagg taccaggaga aacttcagga agcttgtact cgacaataca gtttgagttt 180 

ttatctttgc cccatgaaac ctgtaatt 208 

<210> 41 
<211> 342 
<212> DNA 
<213> E. Coli 

<400> 41 

catcctcaat accgttaaat gcaacccgaa cccccgttgt ccctttgctg cattcactta 60 

acgtaatctg aaaagggacg gctggacttg tgctaccggt cgttggaaat tgtctggcac 120 

tgtttttttg gagatctacg gtaaaattaa gcgaatccga tgagactgtg cagccataat 180 

cgaggacgcg cccgctaatt ttaataacgc tatctgcgga taaagcagaa taggtggtta 240 

accccagaca taaaccgagg aaaataatgt tattgtattt cataatctat tgttccttag 300 

cgacagattg ctgtctgctg gttcagtaag gtaccaggag aa 342 

<210> 42 
<211> 841 
<212> DNA 
<213> £ . Coli 

<220> 

<221> misc_feature 
<222> (1) - . . (841) 
<223> n = A,T,C 

<400> 42 
agatntactg ccaatttccg 
acgaaccacg gctataccgc 
gggcgctgat tcttcccacg 
aggatcgttg agcagcagat 
ctgaataccc gnnagaaaag 
tggcaatngc aatcattttt 
ttntatgccg tcataacttt 
gctncaacac gtaaatttna 
ccccncnnan ttttaagntn 
anttttttng naataattcc 
cggtttnntn nttnncnngg 
tagncrattt naagnggnct 
ttttatatan aattntctta 
gantcccnnt ccctntttaa 
c 

<210> 43 
<211> 215 
<212> DNA 
<213> E. Coli 

<400> 43 



or G 



gcagatcgga aagggttaaa ccatattgat ccataagggt 60 

caggcatggc ttgagccatg gcattaaatt ccgcaaattc 120 

cggttatttt ggcacacacc agatccagca aggggttntc 180 

gatctaccag ttncagcgcc tgggtgtatt gntccttgtt 240 

gtgccacagc anttagcttn tctcctgctt gcaagatgtc 300 

tgcacttant acgatgnaca ncngtaaaga aatcgnattt 360 

acgtatgtan cactttttgc nattcnaaaa aagaccattn 420 

ttgnccccna catttanaac ataaatgntt aaaattttcc 480 

ttnanagaat ngggaattac ctgcttttna atgnactcan 540 

tntatcnaan ctnnttttcn cccaanagnc nnccaaattn 600 

cnttttttta cccnanaann tttattcaan nccttttttg 660 

ttnttnnatt aactttccnn ttggncaaat tttggcnnat 720 

tntcntaatt tnggnanccc cngatgnaan tttatggngg 780 

tnnatgntct gggntatttt taaancctnn attaannnan 840 

841 
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aataactttt cgttaggcag ttttgggtgt gagttgcaag aggggagact actgaataac 60 

tcaagtttta taatcgaggg gaaaatggtg atggcgttca tagcaaaacg ccctcaacca 120 

taaaggtcga gggcgcttaa gatgttaaaa acccgctatc cgttaaaaaa caatgttcaa 180 

ctaaggtcag tgacattgcg ctaaaaaagc gaatt 215 



<210> 44 
<211> 395 
<212> DNA 
<213> E. Coli 

<220> 

<221> misc_f eature 
<222> (1) . . . (395) 
<223> n = A, T, C or G 



<400> 44 

gcattattca tgagaaatgt gtatcgtaaa tcaactgaaa ttaacgcaac catttgttat 60 

ttaaggttta attatctgtg tgtgatattt tattgaatgt tttaaatatt gtttttattg 120 

gcattgctat aatattggtt atcatttgct gaatggattc agtcttaatg agtgggtttt 180 

taagggacag gcatagagta atgatacgta tgcataacca acatctttac tcattatgtc 240 

attgaatgtt gaccctatgt gtttatgaag gagaggtatt ttcagttgat ctggattgnt 300 

aaattcatat aatgcgcctt tgctcatgaa tggatgccag tatgtagtgg gaaattataa 360 

atattgaaat agtccaacta cttctttatt accaa 395 



<210> 45 
<211> 883 
<212> DNA 
<213> E. Coli 

<220> 

<221> misc_f eature 
<222> (1) . . . (883) 
<223> n = A,T,C or G 



<400> 45 

ataatcaggt aagaaaaggt gcgcggagat taccgtgtgt tgcgatatat tttttagttt 60 

cgcgtggcaa tacatcagtg gcaataaaac gacatatcca gaaaaatata cactaagtga 120 

atgatatctt ccgatttatc ttaatcgttt atggataacg gcaaagggct tcgttttttc 180 

ctatacttat tcagcactca caaataaagg aacgccaatg aaaattatac tctgggctgt 240 

attgattatt ttcctgattg ggctactggt ggtgactggc gtatttaaga tgatatttta 300 

aaattaatta atgtcatcag gtccgaaaat aacgagaata tttcagtctc tcatcctgtt 360 

gcgctcctgt catgtgcatt gcttcatata atcactggcg caaggagcgc cgcaggcgna 420 

gnntgcncgn cgncccacct naccccatgc cgaacttcag aantgaaaac nccntaacnc 4 80 

cgatngtcgg cgggngcctc cccatgcnan agtangggaa ntgccangcg ncnnattaaa 540 

cgaaaggctn attncaaaga ctgggccttn cntttatctg atgtttgtcg gagaacgctc 600 

tcctgagnan gacaaatncc gccgggagcg gatttgaacn ttgcgaagca accgncccna 660 

agggngnngt cntgacnccc nnctctanct nncngccttc ttttgcttna angncctcct 720 

ancngatggc ctttttngcc ntctaccaaa cnntttggtt aatgcttnta aaancctttc 780 

cannntncaa tccngtnntn cccatccnnn tnntgaaagn ntncctnccn tgtncantnt 840 

anntnngggg gnngngngcc ggcggncccc cccccccccc ccc 883 



<210> 46 
<211> 1024 
<212> DNA 
<213> E . Coli 

<220> 

<221> misc_f eature 
<222> (1) . . . (1024) 
<223> n = A,T,C or G 
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<400> 46 

gtttatggat aacggcaaag ggcttcgttt tctcctatac ttattcagca ctcacaaata 

aaggaacgcc aatgaaaatt atactctggg ctgtattgat tattttcctg attgggctac 

tggtggtgac tggcgtattt aagatgatat tttaaaatta attaatgtca tcaggtccga 

aaataacgag aatatttcag tctctcatcc tgttgcgctc ctgtcatgtg cattgcttca 

tataatcact ggcgcaagga gcgcgcagag tnctccnant nnnnntnntt ntntnnctnn 

nccttcacna tncnnccncn nantnnatag nncaccnntn ttnntcnnnn gnccncctcc 

nnncnnnnnn ncatnnnatc ccactnnntt tnctccannn nnncnnnntn canccnacaa 

antncnaccn anntnacctt atacnnannc nancnnnnnn nnccactctn nctcgnnctc 

cccnttcnac nnccannnnn cancnntcnn ctnnnnccct nncntaattn ttctnnctan 

ntcctanccn cnnacnnncc cancnatccn nnnatacant cnattnntnn cnntcncntn 

cnccnnttcc nnctnnncnc tnccncatnc ccnnnannan canntncccc ncctncctna 

ccncncncnc ccnccatccc nnnccnncnt ccnnantnga caannnnaat cncnnnnncn 

nnnnnncnnn tnnncncccn gcncnnccnt nccntcacnc tnnncnncta nannnnntac 

nntnaccnnt cctnncacnc tnccctnnng antccnacna ntnnnnnanc nanaacnctn 

tnnnnccata atcccacacc acncccntnc ancntntnnt ncntcntccc ttcntatcnc 

agctnnnnnt nctntnnnnc tnccncccnn cnnactncnn nnacccncnn cccantcagt 

ccaccntccn cnncnnnntn nnncnancan ctnncacncn cnantaacct nntnncacct 
tccc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1024 



<210> 47 
<211> 236 
<212> DNA 
<213> E. Coli 

<400> 47 

atatacacta agtgaatgat atcttccgat ttatcttaat cgtttatgga taacggcaaa 

gggcttcgtt ttttcctata cttattcagc actcacaaat aaaggaacgc caatgaaaat 

tatactctgg gctgtattga ttattttcct gattgggcta ctggtggtga ctggcgtatt 

taagatgata ttttaaaatt aattaatgtc atcaggtccg aaaataacga gaatat 

<210> 48 
<211> 418 
<212> DNA 
<213> E. Coli 

<220> 

<221> misc_feature 
<222> (1) . . . (418) 
<223> n=A,T,Cor G 



60 
120 
180 
236 



<400> 
cggagattac 
ataaaacgac 
ntcgtttatg 
ataaaggaac 
tactggtggt 
cgaaaataac 
tcatataatc 



48 

cgtgtgttgc 
atatccagaa 
gataacggca 
gccaatgaaa 
gactggcgta 
gagaatattt 
actggcgcaa 



gatatatttt 
aaatatacac 
aagggcttcg 
attatactct 
tttaagatga 
cagtctctca 
ggagcgcgca 



ttagtttcgc 
taagtgaatg 
ttttttccta 
gggctgtatt 
cattttaaaa 
tcctgttgcg 
nggggcggcc 



gtggcaatac 
atatcttccg 
tacttattca 
gattattttc 
ttaattaatg 
ctcctgtcat 
aatcgccgcc 



atcagtggca 
attnatctta 
gcactcacaa 
ctgattgggc 
tcatcaggtc 
gtgcattgct 
ggcccctg 



60 
120 
180 
240 
300 
360 
418 



<210> 49 
<211> 550 
<212> DNA 
<213> E. Coli 



<400> 49 



ctgctagtta 
caaacagggg 
caagaaaagg 
ggcaaacacg 
gagttagtta 



cagggaacac 
atgcccagcg 
agaaactgat 
aagaagccct 
tgatggtgac 



taatgacaga 
ttttcgtgca 
gaccattgct 
gcgtattgct 
ccgactttca 



cagctaaaag 
tttattggtg 
gacagattac 
caggagatgc 
ccagacgatc 



ccctgtttaa 
agatagcgga 
gtgaagaagg 
tggatagagg 
ttatcgcgca 



ttacgtatta 
acgcgcacca 
cgcaatgcag 
tttagacaga 
aagccactaa 



60 
120 
180 
240 
300 
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tcctgtaaca ccgggagtta actggcggat gtttgctgta aaccacatca gcgaacgaca 360 

tccgccagcg cctcttctaa atcgtaccag cgaaacgcaa aacccgcttc ttccagccgt 420 

ttaggcagcg cgcgttgtcc acctaatacc agtactgaag attcgcccat taacagtcga 480 

atggcggtcg cggggacgcg caaaatggcc gggcgatgca gcgcatgacc gagcgcatgg 540 

gcaaattgtt 550 

<210> 50 
<211> 99 
<212> DNA 
<213> E. Coli 

<400> 50 

ttggcatctc ggtgttgccg atcttcatga tatccagccc gccggaaact tcttcccaaa 60 
cggttttgct gttatccatt gagtcacgga actgcccct 99 

<210> 51 

<211> 259 

<212> DNA 

<213> E. Coli 

<220> 

<221> misc_feature 
<222> (1) . . . (259) 
<223> n - A,T,C or G 



<400> 51 

ccgtgccgag atgatcctgt naccatcatc cgttgtgaag tagtgattca cgacttcaag 60 

gcgcttttca aaagggtatt ttggctttga catattaggg gctattccat ttcatcgncc 120 

aacaaaatgg gtgcagtaca tactcnttgg aaatcaacac aggaggctgg gaatgccgca 180 

gaaatataga ttactttctt taatagtgat ntgtttcacg cttttatttt tnaaanaagt 240 

tnggcttact tcccgggnn 259 



<210> 52 
<211> 877 
<212> DNA 
<213> E. Coli 

<220> 

<221> misc_f eature 
<222> (1) . . . (877) 
<223> n a A,T,C or G 



<400> 52 

cagcagagcg cggccttctt cgtcagattt cgcagtagtg gtaatggtaa tatccaaacc 60 

acgaacgcgg tcgactttat cgtagtcgat ttctgggaag atgatctgct cacggacacc 120 

catgctgtag ttaccacgac cgtcgaaaga cttagcggac aggccacgga agtcacggat 180 

acgaggtaca gcaatagtga tcaggcgctc aaagaactcc cacatgcgtt cgccacgcag 240 

agttacttta cagccgatcg gatagccctg acggattttg aagcctgcaa cagatttgcg 300 

tgctttggtg atcagcggtt tttgaccgga gattgctgcc aggtctgctg ctgcgttatc 360 

cagcagtttt ttgtcagcga tcgcttcacc aacacccatg ttcagggtga tcttctcgac 420 

ccgagggact tgcatgacag aattgtagtt aaactcagtc atgagttttt taactacttc 480 

gtctttgtag taatcatgca gtttcgccat cgtactactc catgtcggtg aacgctctcc 540 

tgagtaggac aaatccgccg ggagcggatt tgaacgttgc gaagcaacgg cccggagggt 600 

ggcgggcagg acgcccgcca taaactgcca ggcatcaaat taagcagaag gccatcctga 660 

cggatggcct ttttgcgttt ctacaaactc ttttggttat ttttctaaat cattcaaata 720 

tgtatccgnt catcccatcc tatcgatgat aagctgtcaa acatgagaat ttaatcaatc 780 

taaagtttta tggngttaaa cttgggctgg cagnttncca atggcttaat cagtngaggg 840 

ccctatntta acgaactngg ctantttngg tcaatcn 877 



<210> 53 
<211> 291 
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<212> DNA 
<213> E. Coli 

<400> 53 

tgaacagcag agatacggcc agtgcggcca atgttttttg tcctttaaac ataacagagt 60 

cctttaagga tatagaatag gggtacagct acgccagaat atcgtatttg attattgcta 120 

gtttttagtt ttgcttaaaa atattgttag ttttattaaa tgcaaaacta aattattggt 180 

atca-gaatt tgttgtatga tgaataaaat ataggggggt atagatagac gtcattttca 240 

taggcttata aatgcgacta ccatgaagtt tttaattgaa agtattgggt t 291 

<210> 54 
<211> 282 
<212> DNA 
<213> E. Coli 

<400> 54 

ttat^aaatg caaaactaaa ttattggtat catgaatttg ttgtatgatg aataaaatat 60 

aggggggtat agatagacgt cattttcata gggttataaa tgcgactacc atgaagtttt 120 

taattgaaag tattgggttg ctgataattt gagctgttct attcttttta aatatctata 180 

taggcctgtt aatggatttt atttttacaa ttttttgtgt ttaggcatat aaaaatcaac 240 

ccgccatatg aacggcgggt taaaatattt acaacttagc aa 282 

<210> 55 
<211> 293 
<212> DNA 
<213> E. Coli 

<220> 

<221> misc_feature 

<222> (1) . . . (293) 

<223> n = A,T,C or G 

<400> 55 

cggggtccgg cgctcatcaa caatcggggg gcagcaaggg gctgaaacgg gaaagcccct 60 

cccgaagaag gggccttgta taaggaaagg gttatgatga agctcgtcat catactggtt 120 

gtgtngttac tgttaagttt cccgacttac taacaactca tcagaggggg gagaaatcct 180 

ccctcaccct tgttccttta ctctaggttg aaaaaacaac agcgtcaata ggcctgccat 240 

gtacgaagcg agatctgtga accgctttcc ggttagcctt ttttatcctg ttg 293 

<210> 56 

<211> 300 

<212> DNA 

<213> E. Coli 

<400> 56 

tctgcgttcc gctaaaaggt gcaaatgctc aggacgttgc agcgttttgc gtgaccgctc 60 

ggggaaggca aaattgcctc tgggaaagca ttgcgcgggg tccggcgctc atcaacaatc 120 

ggggggcagc aaggggctga aacgggaaag cccctcccga agaaggggcc ttgtataagg 180 

aaagggttat gatgaagctc gtcatcatac tggttgtgtt gttactgtta agtttcccga 240 

cttactaaca actcatcaga ggggggagaa atcctccctt acccttgttc ctttactcta 300 

<210> 57 
<211> 359 
<212> DNA 
<213> E. Coli 



<400> 57 

caacacagga ggctgggaat gccgcagaaa tatagattac cttctttaat agtgatttgt 60 

ttcacgcttt tatttttcac ctggatgata agagattcac tgtgtgaatt gcatattaaa 120 

caggagagtt atgagctggc ggcgttttta gcctgcaaat tgaaagagta agagtcttcg 180 

gcggcaaatt attcccgcct tacttacggc gttgcgcatt ctcattgcac ccaaatttat 240 
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tcttcacaaa aataataata gattttatta cgcgatcgat tatttatttc ctgaaaacaa 300 
ataaaaaaat ccccgccaaa tggcagggat cttagattct gtgcttttaa gcagagatt 359 

<210> 58 
<211> 700 
<212> DNA 
<213> E. Coli 

<220> 

<221> misc_feature 
<222> (1) . . . (700) 
<223> n = A,T,C or G 



<400> 58 

aaaccttttt ctcctgtttt tcatagaggg caacccatgt cctgacctgg gttcggggga 60 

caccaaaacg tgccgagatg atcctgtaac catcatcagt tgtgaagtag tgattcacga 120 

cttcaaggcg cttttcaaaa gggtattttg gctttgacat attaggggct attccatttc 180 

atcgtccaac aaaatgggtg cagtacatac tcgttggaaa tcaacacagg aggctgggaa 240 

tgccgcagaa atatagatta ctttctttaa tagtgatttg tttcacgctt ttatttttca 300 

cctggatgat aagagattca ctgtgtgaat tgcatattaa acaggagagt tatgagctgg 360 

cggcgttttt agcctgcaaa ttgaaagagt aagagtcttc ggcgggaaat tattcccgcc 420 

ttacttacgg cgttgcgcat tctcattgca cccaaattta ttcttcacaa aaataataat 480 

agattttatt acgcgatcga ttatttattt cctgaaaaca aataanaaaa tccccgccaa 540 

atggcaggga tcttagattc tgtgctttta agcagagatt acaggctggt tacgttacca 600 

gctgccgggc ctttaacgcc gctttcgatg gtgaaggaca ctttctgacc ttcgtccaga 660 

gattgtaacc atcggtctgg atagccnaga aatgtccaac 700 



<210> 59 
<211> 631 
<212> DNA 
<213> E. Coli 

<220> 

<221> misc_feature 
<222> (1) . . . (631) 
<223> n = A,T,C or G 



<400> 59 

tggtggcatt ggttgctgga gagagaaaac ccccgcacgt tgcaggtatg cacctgacaa 60 

caccacgggg gcxaatcttg actctagacc actcaagaat agccgcgaaa cgttgtcatt 120 

acaacacagg cggctatatg acgttcgcag agctgggcat ggccttctgg catgatttag 180 

cggctccggt cattgctggc attcttgcca gtatgatcgt gaactggctg aacaagcgga 240 

agtaacgtgt catgcgggcg tcaggctgcc gtaatggcaa tttgcgcccg gaccaggccg 300 

caggggggaa actctgcggc ctttttcgtt cttactgcgg gtaaggcacc cagtcgccgc 360 

cgttcaggcg aacgtacggt ttatcctggt attgaataac tactgcattt gagttctcgg 420 

agaccggtgc tgtttgtggc aacccactgg tgagtttttt ccagtcaaca ttgtcttcgg 480 

tgaaaatctt gccatcgaga acgcgaacca ccagatcgga gatagccagg aagctgctcg 540 

gttgttcgat gacaatcggt gccccctgat gcggtgcctt catgccgaag aatttcaccc 600 

caacggggac gtcngtgata gaccgggcta g 631 



<210> 60 
<211> 648 
<212> DNA 
<213> E . Coli 

<220> 

<221> misc_feature 
<222> (1) . . . (648) 
<223> n - A,T,C or G 

<400> 60 
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ggctcaggcn tgctgattgt 
atggagagaa tcataaacgt 
ttcaaaaaat ttgagggata 
gctgttgctg cttgagggtc 
gtctggccca gcggaccagc 
tgccaaccag cgcccaggag 
ctgttgtctc cagccattca 
tttagnctga tncanattan 
gctctngtct ggtagttagc 
natnngtaan ttatntttnt 
atnntnnnan nnnncagnnc 

<210> 61 

<211> 737 

<212> DNA 

<213> E. Coli 

<220> 

<221> misc_feature 

<222> (1) . . . (737) 

<223> n - A, T, C or G 

<4G0> 61 

tgctaatatc tttctcattg agatgaaaat taaggtaagc gaggaaacac accacaccat 60 

aaacggaggc aaataatgct gggtaatatg aatgttttta tggccgtact gggaataatt 120 

ttattttctg gttttctggc cgcgtatttc agccacaaat gggatgacta atgaacggag 180 

ataatccctc acctaaccgg ccccttgtta cagttgtgta caaggggcct gatttttatg 240 

acggcgaaaa aaaaccgcca gtaaaccggc ggtgaatgct tgcatggata gatttgtgtt 300 

ttgcttttac gctaacaggc attttcctgc actgataacg aatcgttgac acagtagcat 360 

cagttttctc aatgaatgtt aaacggagct taaactcggt taatcacatt ttgttcgtca 420 

ataaacatgc agcgatttct tccggtttgc ttaccctcat acattgcccg gtccgctctt 480 

ccaatgacca catccagagg ctcttcagga aatgcgcgac tcacacctgc tgtcacggta 540 

atgttgatat gcccttcaga atgtgtgatg gcatggttat cgactaactg gcaaattctg 600 

acacctgcac gacatgcttc ttcatcatta gccgctttga caataatgat aaattcttcg 660 

cccccgtagc gataaaccgt ttcgtaatna cgcgtccaac tgggntaagt aaagttgcca 720 

gggtgccgta atcttac 737 

<210> 62 
<211> 648 
<212> DNA 
<213> E. Coli 

<220> 

<221> misc_f eature 

<222> (1) . . . (648) 

<223> n - A,T,C or G 

<400> 62 

.tgcttttgaa tatgtgctcg caatcttgag aaggaaatgg cgaccacgaa agaaaaggca 60 

aaaaccgata atctgaaaga acccaagtat ttcagtataa gcattgaatg ccgaccagta 120 

aactctttcg gattcaccca gaaagtgaan ccaaaatgat aatcgtatac ataagtcttt 180 

cgagtggctc gttagcaaaa agtttcaaca atggagtaaa tacatccaac atatcaataa 240 

ctctcaactg taaggggatt gaaatggtaa ccccagctct tcgcttgagg ggtatagccg 300 

agaccaccga agccccggag gtggtgaaat aaaaccgggc acaacacgaa agggcgcatt 360 

tccgatatcc ataaaagaag tcgggtcttt gtcrggtaaa attaaattgg tgggaagtgc 420 

gcctccgggt tgtaaatacc gactttgctg ggtgtagcct ggcggcatca agtttttttc 480 

tggaagttcg ctgatgtccg ccctttttaa agggaatttt ggtgatgccg gtgaatgccg 540 

cttaaccccc cgtgggccca gttaaaagtc atggtaagnc ctaatnggtt tggggtggga 600 

aaagccnact gnnaattggt tacctggttt gcaagtancc ctggaagg 648 

<210> 63 



ttttttgtgc aatggcccng tattagcgtc gttgctgtcg 60 

ggtgaatgat gattgttagc aaggaaaact gtcaaaaatc 120 

aggccggaat ggctccggcc agagggaagt taaccgcgaa 180 

gttttaacca gacgccaggc gctccatacg ccaaaaccgc 240 

acattaggat ggcgaatcgt ccagatcgcc atcacgctac 300 

cgcagactta gcagcatatt ccancgacga tcgtaagcgc 360 

cgacgactgg cggaagggnc cgcgnctgac caacttgnct 420 

attnataaac gcagnanncn ggtntgatta atcntatttn 480 

nncggnnngt ctcnttntna cccnnttcnn tttannttac 540 

nngtctnant tntanttgng tactntaagt ntatncgnnn 600 

ntntttttta aatnntttnt nanncnnc 648 
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<211> 237 
<212> DNA 
<213> E. Coli 

<220> 

<221> misc_feature 
<222> (1) . . . (237) 
<223> n = A,T,C or G 

<400> 63 

ggtgtttant tacaagagat tcatctttgt ntaaancccn gataagtaat tacgcataaa 

acaacaatga ttataatagc aaaaataaat attatcatct ttgatagatt acttgagata 

gccagcatct tgtaaagcct ttatcgtttt tttatgctct ggattaatat aatcactaca 

tctatctgag caatctgttg ttgatggaca tgtcaaccca tggtcattta cagccaa 

<210> 64 
<211> 427 
<212> DNA 
<213> E . Coli 



60 
120 
180 

237 



<400> 
gataattaga 
aatcatgcca 
cttcaaaatg 
aggaatgccg 
tagccttgcc 
tggcgcacaa 
ggcggtggta 
gctgatg 



64 

gtttgtcgtc 
ttagtcattg 
aacggcttca 
ctggataaag 
ctgatcatgg 
cgtatcgcca 
ctgaccggtt 



agaaaattga 
ttgctatcgg 
tcgctctcgt 
ttattggctc 
gttttggcgc 
ccacgctgat 
ttaccgttgg 



cgttacccat 
tgtaatcttg 
cctcgtggcg 
catcaaagcc 
aatgctgggc 
tgccaaattt 
ttttgccctg 



aacaaatgaa 
ttgttgctcc 
cttgctgttg 
ggtgtcggcg 
aaaatgctgg 
ggtaaaaaac 
ttctatgaag 



aggccaggta 
tgatgatccg 
gattaatgca 
ggacgctcgg 
cagactgcgg 
acatccagtg 
tgggctttgt 



60 
120 
180 
240 
300 
360 
420 
427 



<210> 65 
<211> 261 
<212> DNA 
<213> E. Coli 



<220> 

<221> misc_feature 
<222> (1) . . . (261) 
<223> n = A,T,C or G 

<400> 65 

caaagaacct tcaacatgaa aaatatccat ttgtttgcaa aaaaagatta ttaggaagga 
aattaatgca attatcgaaa attcaaaaaa tatccaaaaa tngtatactt tattccagaa 
gagttcaata taatgtttgt cttcaatttt tcttacttca gggtaatata gattgctcat 
tacattgtga gcttcatctt tatttaattt tctgttgact ccagctctcc gtgataacgg 
ttttataatt agatgcttat c 



60 
120 
180 
240 
261 



<210> 66 
<211> 98 
<212> DNA 
<213> E . Coli 



<400> 66 

agatgattgc cgggaacttg ttagcggcac gcaggcggcg gctcgcaccc ttaccctgct 
ctttacgtac ttctgcgttg atagtaaaca tttctttc 

<210> 67 
<211> 260 
<212> DNA 
<213> E. Coli 



60 
98 
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<400> 67 

aagcgcgaac gaagtcgatg tgctgcagct tcggtttgta cgggtgacgc tgtacgtcct 60 

gagctttaac tttgatttct ttaccgtcaa caacgatggt cagaacttcg ctgtagaatt 120 

cagctttagc ttgcatgttc atgactttgt cgtgatccag ctcgatagcc agcggcgctt 180 

ctttgccacc gtagatgatt gccgggaact tgttagcggc acgcaggcgg cggctcgcac 240 

ccttaccctg ctctttacgt 260 

<210> 68 
<211> 95 
<212> DNA 
<213> E. Coli 

<400> 68 

aaaaacggcg taaagaaagg ttgcaaacat gttaataaaa actcaaattg atcccacgta 60 
tatatcacgc cgcaaaatcc ttacaataaa caggg 95 



<210> 69 
<211> 174 
<212> DNA 
<213> E . Coli 

<400> 69 

ttaatratta aaatagtgta acgcgattat gtggttatgg gggtaaacat taaataaacc 

agcggggagg ggaggtaaag tgaaaaaata aaaagcggat aatcttaata agcaggccgg 

acagcatcgc catccggcac tgatacgagg tttatttcag ctcatcaacc atcg 

<210> 70 

<211> 138 

<212> DNA 

<213> E. Coli 



60 
120 
174 



<400> 70 

agtctgtaaa aacgtcaaaa agagtgtttt atcaacagaa gaatggaggt ctgacagata 60 

gtagtaatgc aaaaaaatgg agacttaagt tgaatgaacg ggagtaaagc gaaaagacta 120 

tagagcgaag gagaaatt 138 



<210> 71 
<211> 191 
<212> DNA 
<213> E . Coli 

<400> 71 

tttgttggct taatattcta ttgttatctt tatttataga tgtttatatt gcatgaggtg 
gtttttggag agaagaatga ggaagatgcg tcgagccaca gaaacgttag ctttacatat 
agcggaggtg atgtgaattt aatttacaat agaaataatt tacatatcaa acagttagat 
gctttttgtc g 

<210> 72 
<211> 244 
<212> DNA 
<213> E. Coli 



60 
120 
180 
191 



<400> 72 

ggccatttat acaggaaaag cctatgtcag aacgtaaaaa ctcaaaatca cgccgtaatt 60 

atctcgttaa atgttcctgc ccaaactgca cccaagagtc agaacacagt ttttcaagag 120 

tacaaaaagg tgcccttttg atctgccctc attgcaacaa agtattccag acaaatctta 180 

aagctgtagc ctgattgatt ttattagtaa caagtatttt ttatatttta ataatatatt 240 

taaa 244 



<210> 73 
<211> 327 
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<212> DNA 
<213> E. Coli 

<220> 

<221> misc_f eature 

<222> (1) . . . (327) 

<223> n = A, T,C or G 

<400> 73 

aaattttcag gtaccttgtc accatacttt tttttctgag cattaatgat attttgagct 60 

tcttgaggat ctttaactcc ccacatttgg tggaaagtat tcatattaaa aggaaggntg 120 

aataatttgn ctttataaat cgccagtgga gaattagtaa aacgattaaa ttctactaaa 180 

tnattaaccg naaaaaaatt cccatatata tttatcattg gtatgaaaaa tatgtgcacc 240 

atatttatga atntggatac cctnacagtc ctctgtgtac gcatttccac cgatatgatt 300 

tcttttctna atcactaaaa ctttttt 327 

<210> 74 
<211> 150 
<212> DNA 
<213> E. Coli 

<400> 74 

gcagtgatcg aagcgatgac gaagtgtatg gaaaaatcag aaaaactcag caaatcctga 60 

tgactttcgc cggacgtcag gccgccactt cggtgcggtt acgtccggct ttctttgctt 120 

tgtaaagcgc caaatctgcc gatttcaacc 150 

<210> 75 

<211> 330 

<212> DNA 

<213> E. Coli 

<400> 75 

gaaagtatct tcgttattga catcactgga aaatataact tgcttttcat tattaaactc 60 

gaagcgcgta ccgtatctgg acaaacattt atcgagctta ccaaattcct gaagaggttt 120 

aactacagat aacatttgcg cgtcctttgc agtaatgccc gtcaaatcct tgacgggcat 180 

tatttagatt aaattaccag tatttcttcg gagtgaagaa tattaccagg tatatttaac 240 

acccacgttc gcggaccagt cttgatctac gtcaccacca ccgaggtagt tagcatcggt 300 

ataggcgctg aagttcttgg tgaagctaaa 330 

<210> 76 
<211> 194 
<212> DNA 
<213> E. Coli 

<400> 76 

tgtttttttc cagcaacgga gcaaaaggtt tgcccttgtg cagctcaggg ttaaccactt 60 

taactacgtg gcgacgaccc ggagatgtcg gtttacattt aacaactgcc attgtattac 120 

tcctccgact tactcagcgc cgccaacgaa gtccagattc tggccttctt tcagggtgac 180 
gtaagctttt ttcc 194 

<210> 77 
<211> 188 
<212> DNA 
<213> E. Coli 

<400> 77 

tccctttaac taccagggtg ttaacgactt cgacttcgac ttcaaacagt ttctgcacag 60 

cagctttgat ttctgctttg gtcgcgtctt tagcaacttt gagtactatg gtgttggatt 120 

tttccaccgc agtagacgct ttttcagaaa cgtgcggtgc acgcagcacc ttcagcagac 180 

gttcttca 18 8 
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<210> 78 
<211> 173 
<212> DNA 
<213> E. Coli 

<400> 78 

acaaaggcga acaaagcctg tgaagcccga aggctccaca gacagtgcta cttgaaggcc 60 

ttactgtttc ttcttaggag cgagcaccat gatcatctgg cggccttcga tcttggttgg 120 

gaaggattcg accactgcca gttcttgcaa atcgtctttc acgcgattaa gca 173 

<210> 79 
<211> 272 
<212> DNA 
<213> E . Coli 

<220> 

<221> misc_feature 
<222> (1) . . . (272) 
<223> n « A,T,C or G 

<400> 79 

tggagaaaac gggtgattga taaagcaatc atcgttctag gggcgttaat tgcgctgctg 60 

gaactgatcc cgctttctgc ttcaagcttc tgaactggat acggaaacgt aatnagggct 120 

aaagaagaca ctactcttag ccctttaaca tttaacgcat tgtcacgaac tcttctgccg 180 

ccgttgggtg aatggcgacg ggtattggtc gaaatctttt ttgggtggcc ccatctttaa 240 

cgcccacccg cgaaaccctg caacatttcg tc 272 

<210> 80 
<211> 259 
<212> DNA 
<213> E. Coli 

<400> 80 

cgcaggcagc tgatggtcaa caggatgaga gaaacccaga. gacaggttaa tcacattgcc 60 

tttaaccgct gcacggtaac ctacaccaac cagctgcagc ttcttagtga agccttcggt 120 

aacaccgata accattgagt tcagcagggc acgcgcggta ccagcctgtg cccaaccgtc 180 

tgcgtaacca tcacgcggac cgaaggtcag ggtattatct gcatgtttaa cttcaacagc 240 

atcgttgaga gtacgagtc 259 

<210> 81 
<211> 73 
<212> DNA 
<213> E. Coli 

<400> 81 

caggtcggaa cttacccgac aaggaatttc gctaccttag gaccgttata gttacggccg 60 
ccgtttaccg ggg 73 

<210> 82 
<211> 666 
<212> DNA 
<213> E . Coli 

<400> 82 

atgaacgttt tctcgcaaac tcaacgctat aaggcgttgt tctggttatc gttatttcat 60 

ctgctggtga tcacctccag taactatctg gttcagcttc ccgtctccat tttgggtttc 120 

cataccacct ggggcgcgtt tagctttccg tttatttttc ttgctaccga cctgaccgtg 180 

cgtatttttg gcgcaccgct ggcccgacgc attatcttcg cggtaatgat ccctgcgtta 240 

ttaatctcct acgtcatctc gtcgctattc tatatgggtt cctggcaggg attcggcgca 300 

ctcgcccact tcaacctgtt tgtcgcccgt atcgccaccg ccagtttcat ggcctacgcg 360 

ctggggcaaa tcctcgacgt gcacgttttt aaccgcctgc gtcagagtcg ccgctggtgg 420 
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ctggcaccga cagcgtccac actgttcggt aacgtcagcg acacgctggc ctttttcttc 480 

attgccttct ggcgtagccc ggatgccttt atggctgaac actggatgga aatcgcgctg 540 

gtcgattact gtttcaaagt gttaatcagt atcgttttct tcctgccaat gtatggcgta 600 

ttactcaata tgctgttgaa aagactggca gataaatccg aaatcaacgc tttgcaggcg 660 

agttaa 666 

<210> 83 
<211> 612 
<212> DNA 
<213> E. Coli 



<400> 83 



gtgataagat 
aatctggtta 
aaagacacta 
gtcaacctgc 
ggtaagctgg 
gcagtcatcg 
ctggaagatc 
gagctgaaag 
gcgtactctt 
ctgcccgctg 
gttattcagt 



ggatgaatga 
aatatgtcgg 
acgctacggc 
ttgatggcaa 
gaacgcaggc 
tcatcatggg 
agcaacgtag 
gtcacaaaat 
ccgttattct 
acgatcagca 
aa 



gccgttatgg 
aattggcctg 
gcagggttcg 
gttaagtttc 
caataacatg 
cgatgatccg 
ccgcgatccg 
gcagcagtta 
gggtaacgtg 
aaaagcgcag 



ccgtttatcg 
ctggttatgg 
gtcgcggaaa 
tcgctgccag 
catgtctggt 
aaagaagatc 
cagctgcaag 
gacagtatta 
ggtaatcaac 
accaccgcag 



aaaggaagaa 
ggcttgcggc 
gtaacgctac 
cggatatgac 
ccgacgccac 
tggcggtgct 
tggtaaccaa 
tctccgcgaa 
tgctgaccat 
aaaacatcat 



gtcaatgcgc 
ctgtgatgat 
cgggaatccc 
cgaccagagc 
cgggcagaaa 
ggcgaagcgt 
taaagccatt 
aggccagacg 
gcaaattacg 
taatacgctg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
612 



<210> 84 
<211> 975 
<212> DNA 
<213> E. Coli 

<400> 84 

atggcgaata tgtttgccct gattctggtg attgccacac tggtgacggg cattttatgg 60 

tgcgtggata aattcttttt cgcacctaaa cggcgggaac gtcaggcagc ggcgcaggcg 120 

gctgccgggg actcactgga taaagcaacg ttgaaaaagg ttgcgccgaa gcctggctgg 180 

ctggaaaccg gtgcttctgt ttttccggta ctggctatcg tattgattgt gcgttcgttt 240 

atttatgaac cgttccagat cccgtcaggt tcgatgatgc cgactctgtt aattggtgat 300 

tttattctgg tagagaagtt tgcttatggc attaaagatc ctatctacca gaaaacgctg 360 

atcgaaaccg gtcatccgaa acgcggcgat atcgtggtct ttaaatatcc ggaagatcca 420 

aagcttgatt acatcaagcg cgcggtgggt ttaccgggcg ataaagtcac ttacgatccg 480 

gtctcaaaag agctgacgat tcaaccggga tgcagttccg gccaggcgtg tgaaaacgcg 540 

ctgccggtca cctactcaaa cgtggaaccg agcgatttcg ttcagacctt ctcacgccgt 600 

aatggtgggg aagcgaccag cggattcttt gaagtgccga aaaacgaaac caaagaaaat 660 

ggaattcgtc tttccgagcg taaagagaca ctgggtgatg tgacgcaccg cattctgaca 720 

gtgccgattg cgcaggatca ggtggggatg tattaccagc agccagggca acaactggca 780 

acctggattg ttcctccggg acaatacttc atgatgggcg acaaccgcga caacagcgcg 840 

gacagccgtt actggggctt tgtgccggaa gcgaatctgg tcggtcgggc aacggctatc 900 

tggatgagct tcgataagca agaaggcgaa tggccgactg gtctgcgctt aagtcgcatt 960 

ggcggcatcc attaa 975 

<210> 85 
<211> 1761 
<212> DNA 
<213> E. Coli 



<400 
ttgaccatta 
atatttgctc 
cggctgcaat 
gcgaaggccc 
caaagcgatg 
ttacgtcacg 
gactgggtag 



> 85 
cgaaacttgc 
agccacattt 
ttgcgctgga 
cggaagagtc 
caggccaact 
cagtgagtgc 
aagcggagca 



atggcgtgac 
gattgacgaa 
gcagttgctg 
tgagtatctg 
ggtgggcggt 
agatgataat 
actctttggc 



ctggttcctg 
aacgatcctt 
catacgcgag 
aatcttattg 
cactatgagg 
tttgcgactt 
tgcctgcgcc 



ataccgatag 
tattcagtga 
catcctcctc 
ccaatgccgc 
tttccggcca 
taacgcaagt 
agtttaatgg 



ctatcaggaa 
tactcaaccg 
ttttatgctg 
gcgtacgcta 
ctccatccgc 
tgtcgctgcc 
cgacattacc 



60 
120 
180 
240 
300 
360 
420 
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ctgcagcctg 
ctgctggcgc 
gactgggttg 
aagctgaaag 
ccagagcttt 
gcggagtcag 
cctgcaccgg 
gaacaagaaa 
tccctgtgtg 
gaatggcgcg 
ctgattgaaa 
ccgggtcatc 
gatggtgaat 
gggatgatga 
ttctcagcat 
atggctgaac 
gctatcacag 
aaaatcgaag 
attatcccca 
gaagaaggca 
ttaaatctgg 
atcgcgcaag 
aactggttta 



gtctggtgca 
aacctctgct 
cgtttgatga 
tcattctggt 
cagagcaggc 
taacccagtg 
gagcggatgc 
cacttccgct 
atggcgacac 
aaggtttcct 
ccgaaggcga 
cacgcgcttt 
tcaccgacat 
tcatgcaagc 
cgctgacatt 
tctgcgccct 
gttcagtcga 
gcttctttgc 
cagctaacgt 
aattcaccat 
tgtgggatgg 
catcgcaaca 
ttccgaactg 



tcaggcaaat 
gtggatgcgg 
gtcgcgccct 
aggcgaacgc 
tatttatagc 
gtgtcgctgg 
ctggccgata 
tagcccgcag 
cttctccggc 
cgctgaacgt 
acgcatcggg 
tggcgaacct 
cgaacgcaaa 
gttcctgatg 
tgagcagtca 
gataagcgcc 
tcagttcggt 
tatttgccag 
tcgccattta 
ctgggcagta 
cgaaggccaa 
ggaaggacgt 



ggcggtattc 
ctgaaaaata 
ctccccgtct 
gaatcattgg 
gaatttgaag 
gtgacattta 
cttatccgcg 
tggatcctcc 
gagcagctaa 
atgcaggatg 
caaattaacg 
tctcgcatta 
gcggagcttg 
tcggaactac 
tacagtgaag 
ctcgccgatg 
cgcgcccagc 
caacgtgagt 
agtcttcaca 
gacgatgtga 
acgacgctga 
caccgttttc 



tcattatctc 
tcgttaaccg 
ctgtgccttc 
ctgatttcca 
atactctgca 
ccgccagaca 
aagcagcacg 
gccagtgtaa 
acttaatgct 
agatccttca 
ccctttcggt 
gctgcgttgt 
gcggcaatat 
agcttgagca 
ttgatggaga 
tgccggtgaa 
cggtcggtgg 
taaccgggaa 
gtgaactggt 
ctgacgcact 
tgcaaaccat 
catggccatt 



tttgcgtaca 
cgagcgtttt 
gatgccattg 
ggagatggag 
gattgtcgat 
taatcactta 
ctacaccggt 
agaggtcgcc 
gcagcagcgt 
ggagcaaatc 
cattgaattt 
gcatattggc 
ccatgcgaaa 
acagatcccc 
tagtgcctcg 
tcagagtatc 
tttaaatgag 
acaaggtgtc 
gaaagcggta 
gccgttatta 
ccaggaacgt 
acgttggctg 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1761 



<210> 86 
<211> 1185 
<212> DNA 
<213> E. Coli 

<400> 86 

gtgtctaaag aaaaatttga acgtacaaaa ccgcacgtta acgttggtac tatcggccac 60 

gttgaccacg gtaaaactac tctgaccgct gcaatcacca ccgtactggc taaaacctac 120 

ggcggtgctg ctcgtgcatt cgaccagatc gataacgcgc cggaagaaaa agctcgtggt 180 

atcaccatca acacttctca cgttgaatac gacaccccga cccgtcacta cgcacacgta 240 

gactgcccgg ggcacgccga ctatgttaaa aacatgatca ccggtgctgc tcagatggac 300 

ggcgcgatcc tggtagttgc tgcgactgac ggcccgatgc cgcagactcg tgagcacatc 360 

ctgctgggtc gtcaggtagg cgttccgtac atcatcgtgt tcctgaacaa atgcgacatg 420 

gttgatgacg aagagctgct ggaactggtt gaaatggaag ttcgtgaact tctgtctcag 480 

tacgacttcc cgggcgacga cactccgatc gttcgtggtt ctgctctgaa agcgctggaa 540 

ggcgacgcag agtgggaagc gaaaatcctg gaactggctg gcttcctgga ttcttatatt 600 

ccggaaccag agcgtgcgat tgacaagccg ttcctgctgc cgatcgaaga cgtattctcc 660 

atctccggtc gtggtaccgt tgttaccggt cgtgtagaac gcggtatcat caaagttggt 720 

gaagaagttg aaatcgttgg tatcaaagag actcagaagt ctacctgtac tggcgttgaa 780 

atgttccgca aactgctgga cgaaggccgt gctggtgaga acgtaggtgt tctgctgcgt 84 0 

ggtatcaaac gtgaagaaat cgaacgtggt caggtactgg ctaagccggg caccatcaag 900 

ccgcacacca agttcgaatc tgaagtgtac attctgtcca aagatgaagg cggccgtcat 960 

actccgttct tcaaaggcta ccgtccgcag ttctacttcc gtactactga cgtgactggt 1020 

accatcgaac tgccggaagg cgtagagatg gtaatgccgg gcgacaacat caaaatggtt 1080 

gttaccctga tccacccgat cgcgatggac gacggtctgc gtttcgcaat ccgtgaaggc 1140 

ggccgcaccg ttggcgcggg cgttgttgct aaagttctgg gctaa 1185 

<210> 87 
<211> 2115 
<212> DNA 
<213> E. Coli 



<400> 87 

atggctcgta caacacccat cgcacgctac cgtaacatcg gtatcagtgc gcacatcgac 60 

gccggtaaaa ccactactac cgaacgtatt ctgttctaca ccggtgtaaa ccataaaatc 120 

ggtgaagttc atgacggcgc tgcaaccatg gactggatgg agcaggagca ggaacgtggt 180 

attaccatca cttccgctgc gactactgca ctctggtctg gtatggctaa gcagtatgag 240 
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ccgcatcgca tcaacatcat cgacaccccg gggcacgttg acttcacaat cgaagtagaa 300 
cgttccatgc gtgttctcga tggtgcggta atggtttact gcgcagttgg tggtgttcag 360 
ccgcagtctg aaaccgtatg gcgtcaggca aacaaatata aagttccgcg cattgcgttc 420 
gttaacaaaa tggaccgcat gggtgcgaac ttcctgaaag ttgttaacca gatcaaaacc 480 
cgtctgggcg cgaacccggt tccgctgcag ctggcgattg gtgctgaaga acatttcacc 540 
ggtgttgttg acctggtgaa aatgaaagct atcaactgga acgacgctga ccagggcgta 600 
accttcgaat acgaagatat cccggcagac atggttgaac tggctaacga atggcaccag 660 
aacctgatcg aatccgcagc tgaagcttct gaagagctga tggaaaaata cctgggtggt 720 
gaagaactga ctgaagcaga aatcaaaggt gctctgcgtc agcgcgttct gaacaacgaa 780 
atcatcctgg taacctgtgg ttctgcgttc aagaacaaag gtgttcaggc gatgctggat 840 
gcggtaattg attacctgcc atccccggtt gacgtacctg cgatcaacgg tatcctggac 900 
gacggtaaag acactccggc tgaacgtcac gcaagtgatg acgagccgtt ctctgcactg 960 

gcgttcaaaa tcgctaccga cccgtttgtt ggtaacctga ccttcttccg tgtttactcc 1020 

ggtgtggtta actctggtga taccgtactg aactccgtga aagctgcacg tgagcgtttc 1080 

ggtcgtatcg ttcagatgca cgctaacaaa cgtgaagaga tcaaagaagt tcgcgcgggc 1140 

gacatcgctg ctgctatcgg tctgaaagac gtaaccactg gtgacaccct gtgtgacccg 1200 

gatgcgccga tcattctgga acgtatggaa ttccctgagc cggtaatctc catcgcagtt 1260 

gaaccgaaaa ccaaagctga ccaggaaaaa atgggtctgg ctctgggccg tctggctaaa 1320 

gaagacccgt ctttccgtgt atggactgac gaagaatcta accagaccat catcgcgggt 1380 

atgggcgaac tgcacctcga catcatcgtt gaccgtatga agcgtgaatt caacgttgaa 14 40 

gcgaacgtag gtaaaccgca ggttgcttac cgtgaaacta tccgccagaa agttaccgat 1500 

gttgaaggta aacacgcgaa acagtctggt ggtcgtggtc agtatggtca tgttgttatc 1560 

gacatgtacc cgctggagcc gggttcaaac ccgaaaggct acgagttcat caacgacatt 1620 

aaaggtggtg taatccctgg cgaatacatc ccggccgttg ataaaggtat ccaggaacag 1680 

ctgaaagcag gtccgctggc aggctacccg gtagtagaca tgggtattcg tctgcacttc 1740 

ggttcttacc atgacgttga ctcctctgaa ctggcgttta aactggctgc ttctatcgcc 1800 

tttaaagaag gctttaagaa agcgaaacca gttctgcttg agccgatcat gaaggttgaa 1860 

gtagaaactc cggaagagaa caccggtgac gttatcggtg acttgagccg tcgtcgtggt 1920 

atgctcaaag gtcaggaatc tgaagttact ggcgttaaga tccacgctga agtaccgctg 1980 

tctgaaatgt tcggatacgc aactcagctg cgttctctga ccaaaggtcg tgcatcatac 204 0 

actatggaat tcctgaagta tgatgaagcg ccgagtaacg ttgctcaggc cgtaattgaa 2100 

gcccgtggta aataa 2115 

<210> 88 
<211> 540 
<212> DNA 
<213> E. Coli 

<400> 88 

atgccacgtc gtcgcgtcat tggtcagcgt aaaattctgc cggatccgaa gttcggatca 60 
gaactgctgg ctaaatttgt aaatatcctg atggtagatg gtaaaaaatc tactgctgaa 120 
tctatcgtat acagcgcgct ggagaccctg gctcagcgct ctggtaaatc tgaactggaa 180 
gcattcgaag tagctctcga aaacgtgcgc ccgactgtag aagttaagtc tcgccgcgtt 240 
ggtggttcta cttatcaggt accagttgaa gtccgtccgg ttcgtcgtaa tgctctggca 300 
atgcgttgga tcgttgaagc tgctcgtaaa cgcggtgata aatccatggc tctgcgcctg 360 
gcgaacgaac tttctgatgc tgcagaaaac aaaggtactg cagttaagaa acgtgaagac 420 
gttcaccgta tggccgaagc caacaaggcg ttcgcacact accgttggtt atcccttcgg 480 
agttttagtc accaggcggg cgcttccagt aagcagcccg ctttgggcta cttaaattga 540 

<210> 89 
<211> 1549 
<212> DNA 
<213> £ . Coli 

<400> 89 

aaattgaaga gtttgatcat ggctcagatt gaacgctggc ggcaggccta acacatgcaa 60 
gtcgaacggt aacaggaagc agcttgctgc ttcgctgacg agtggcggac gggtgagtaa 120 
tgtctgggaa gctgcctgat ggagggggat aactactgga aacggtagct aataccgcat 180 
aatgtcgcaa gaccaaagag ggggaccttc gggcctcttg ccatcggatg tgcccagatg 240 
ggattagctt gttggtgggg taacggctca ccaaggcgac gatccctagc tggtctgaga 300 
ggatgaccag ccacactgga actgagacac ggtccagact cctacgggag gcagcagtgg 360 
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ggaatattgc acaatgggcg caagcctgat gcagccatgc cgcgtgtatg aagaaggcct 420 

tcgggttgta aagtactttc agcggggagg aagggagtaa agttaatacc tttgctcatt 480 

gacgttaccc gcagaagaag caccggctaa ctccgtgcca gcagccgcgg taatacggag 54 0 

ggtgcaagcg ttaatcggaa ttactgggcg taaagcgcac gcaggcgggt tggttaagtc 600 

agatgtgaaa tccccgggct caacctggga actgcatctg atactggcaa gcttgagtct 660 

cgtagagggg ggtagaattc caggtgtagc ggtgaaatgc gtagagatct ggaggaatac 720 

cggtggcgaa ggcggccccc tggacgaaga ctgacgctca ggtgcgaaag cgtggggagc 780 

aaacaggatt agataccctg gtagtccacg ccgtaaacga tgtcgacttg gaggttgtgc 840 

ccttgaggcg tggcttccgg agctaacgcg ttaagtcgac cgcctgggga gtacggccgc 900 

aaggttaaaa ctcaaatgaa ttgacggggg cccgcacaag cggtggagca tgtggtttaa 960 

ttcgatgcaa cgcgaagaac cttacctggt cttgacatcc acggaagttt tcagagatga 1020 

gaatgtgcct tcgggaaccg tgagacaggt gctgcatggc tgtcgtcagc tcgtgttgtg 1080 

aaatgttggg ttaagtcccg caacgagcgc aacccttatc ctttgttgcc agcggtccgg 1140 

ccgggaactc aaaggagact gccagtgata aactggagga aggtggggat gacgtcaagt 1200 

catcatggcc cttacgacca gggctacaca cgtgctacaa tggcgcatac aaagagaagc 1260 

gacctcgcga gagcaagcgg acctcataaa gtgcgtcgta gtccggattg gagtctgcaa 1320 

ctcgactcca tgaagtcgga atcgctagta atcgtggatc agaatgccac ggtgaatacg 1380 

ttcccgggcc ttgtacacac cgcccgtcac accatgggag tgggttgcaa aagaagtagg 1440 

tagcttaacc ttcgggaggg cgcttaccac tttgtgattc atgactgggg tgaagtcgta 1500 

acaaggtaac cgtaggggaa cctgcggttg gatcacctcc ttaccttaa 1549 

<210> 90 
<211> 375 
<212> DNA 
<213> E . Coli 

<400> 90 

atggcaacag ttaaccagct ggtacgcaaa ccacgtgctc gcaaagttgc gaaaagcaac 60 

gtgcctgcgc tggaagcatg cccgcaaaaa cgtggcgtat gtactcgtgt atatactacc 120 

actcctaaaa aaccgaactc cgcgctgcgt aaagtatgcc gtgttcgtct gactaacggt 180 

ttcgaagtga cttcctacat cggtggtgaa ggtcacaacc tgcaggagca ctccgtgatc 240 

ctgatccgtg gcggtcgtgt taaagacctc ccgggtgttc gttaccacac cgtacgtggt 300 

gcgcttgact gctccggcgt taaagaccgt aagcaggctc gttccaagta tggcgtgaag 360 

cgtcctaagg cttaa 375. 

<210> 91 
<211> 366 
<212> DNA 
<213> E. Coli 

<400> 91 

atgtctatca ctaaagatca aatcattgaa gcagttgcag ctatgtctgt aatggacgtt 60 

gtagaactga tctctgcaat ggaagaaaaa ttcggtgttt ccgctgctgc tgctgtagct 120 

gtagctgctg gcccggttga agctgctgaa gaaaaaactg aattcgacgt aattctgaaa 180 

gctgctggcg ctaacaaagt tgctgttatc aaagcagtac gtggcgcaac tggcctgggt 240 

ctgaaagaag ctaaagacct ggtagaatct gcaccggctg ctctgaaaga aggcgtgagc 300 

aaagacgacg cagaagcact gaaaaaagct ctggaagaag ctggcgctga agttgaagtt 360 

aaataa - 3gg 

<210> 92 
<211> 498 
<212> DNA 
<213> £ . Coli 

<400> 92 

atggctttaa atcttcaaga caaacaagcg attgttgctg aagtcagcga agtagccaaa 60 

ggcgcgctgt ctgcagtagt tgcggattcc cgtggcgtaa ctgtagataa aatgactgaa 120 

ctgcgtaaag caggtcgcga agctggcgta tacatgcgtg ttgttcgtaa caccctgctg 180 

cgccgtgctg ttgaaggtac tccgttcgag tgcctgaaag acgcgtttgt tggtccgacc 240 

ctgattgcat actctatgga acacccgggc gctgctgctc gtctgttcaa agagttcgcg 300 

aaagcgaatg caaaatttga ggtcaaagcc gctgcctttg aaggtgagct gatcccggcg 360 
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tctcagatcg accgcctggc aactctgccg acctacgaag aagcaattgc acgcctgatg 420 

gcaaccatga aagaagcttc ggctggcaaa ctggttcgta ctctggctgc tgtacgcgat 480 

gcgaaagaag ctgcttaa 4 gg 

<210> 93 
<211> 2145 
<212> DNA 
<213> E. Coli 

<400> 93 

gtgtcccgta ttattatgct gatccctacc ggaaccagcg tcggtctgac cagcgtcagc 60 

cttggcgtga tccgtgcaat ggaacgcaaa ggcgttcgtc tgagcgtttt caaacctatc 120 

gctcagccgc gtaccggtgg cgatgcgccc gatcagacta cgactatcgt gcgtgcgaac 180 

tcttccacca cgacggccgc tgaaccgctg aaaatgagct acgttgaagg tctgctttcc 240 

agcaatcaga aagatgtgct gatggaagag atcgtcgcaa actaccacgc taacaccaaa 300 

gacgctgaag tcgttctggt tgaaggtctg gtcccgacac gtaagcacca gtttgcccag 360 

tctctgaact acgaaatcgc taaaacgctg aatgcggaaa tcgtcttcgt tatgtctcag 420 

ggcactgaca ccccggaaca gctgaaagag cgtatcgaac tgacccgcaa cagcttcggc 480 

ggtgccaaaa acaccaacat caccggcgtt atcgttaaca aactgaacgc accggttgat 540 

gaacagggtc gtactcgccc ggatctgtcc gagattttcg acgactcttc caaagctaaa 600 

gtaaacaatg ttgatccggc gaagctgcaa gaatccagcc cgctgccggt tctcggcgct 660 

gtgccgtgga gctttgacct gatcgcgact cgtgcgatcg atatggctcg ccacctgaat 720 

gcgaccatca tcaacgaagg cgacatcaat actcgccgcg ttaaatccgt cactttctgc 780 

gcacgcagca ttccgcacat gctggagcac ttccgtgccg gttctctgct ggtgacttcc 840 

gcagaccgtc ctgacgtgct ggtggccgct tgcctggcag ccatgaacgg cgtagaaatc 900 

ggtgccctgc tgctgactgg cggttacgaa atggacgcgc gcatttctaa actgtgcgaa 960 

cgtgctttcg ctaccggcct gccggtattt atggtgaaca ccaacacctg gcagacctct 1020 

ctgagcctgc agagcttcaa cctggaagtt ccggttgacg atcacgaacg tatcgagaaa 1080 

gttcaggaat acgttgctaa ctacatcaac gctgactgga tcgaatctct gactgccact 1140 

tctgagcgca gccgtcgtct gtctccgcct gcgttccgtt atcagctgac tgaacttgcg 1200 

cgcaaagcgg gcaaacgtat cgtactgccg gaaggtgacg aaccgcgtac cgttaaagca 1260 

gccgctatct gtgctgaacg tggtatcgca acttgcgtac tgctgggtaa tccggcagag 1320 

atcaaccgtg ttgcagcgtc tcagggtgta gaactgggtg cagggattga aatcgttgat 1380 

ccagaagtgg ttcgcgaaag ctatgttggt cgtctggtcg aactgcgtaa gaacaaaggc 1440 

atgaccgaaa ccgttgcccg cgaacagctg gaagacaacg tggtgctcgg tacgctgatg 1500 

ctggaacagg atgaagttga tggtctggtt tccggtgctg ttcacactac cgcaaacacc 1560 

atccgtccgc cgctgcagct gatcaaaact gcaccgggca gctccctggt atcttccgtg 1620 

ttcttcatgc tgctgccgga acaggtttac gtttacggtg actgtgcgat caacccggat 1680 

ccgaccgctg aacagctggc agaaatcgcg attcagtccg ctgattccgc tgcggccttc 1740 

ggtatcgaac cgcgcgttgc tatgctctcc tactccaccg gtacttctgg tgcaggtagc 1800 

gacgtagaaa aagttcgcga agcaactcgt ctggcgcagg aaaaacgtcc tgacctgatg 1860 

atcgacggtc cgctgcagta cgacgctgcg gtaatggctg acgttgcgaa atccaaagcg 1920 

ccgaactctc cggttgcagg tcgcgctacc gtgttcatct tcccggatct gaacaccggt 1980 

aacaccacct acaaagcggt acagcgttct gccgacctga tctccatcgg gccgatgctg 2040 

cagggtatgc gcaagccggt taacgacctg tcccgtggcg cactggttga cgatatcgtc 2100 

tacaccatcg cgctgactgc gattcagtct gcacagcagc agtaa 2145 

<210> 94 
<211> 1767 
<212> DNA 
<213> E. Coli 

<400> 94 

atgaataatt ctattaacca taaatttcat cacattagcc gggctgaata ccaggaattg 60 

ttagccgttt cccgtggcga cgctgttgcc gattatatta ttgataatgt ctctattctc 120 

gacctgatca atggcggaga aatttccggc ccaattgtga ttaaaggacg ttacattgcc 180 

ggtgttggcg cagaatacac tgatgctccg gctttgcagc ggattgatgc tcgcggcgca 240 

acggcggtgc cagggtttat tgatgctcac ctgcatattg aatccagcat gatgacgccg 300 

gtcacttttg aaaccgctac cctgccgcgc ggcctgacga ccgttatttg cgaccctcat 360 

gaaatcgtca acgtgatggg cgaagccgga ttcgcctggt ttgcccgctg tgccgaacag 420 

gcaaggcaaa accagtactt acaggtcagc tcttgcgtac ccgccctgga aggctgcgat 480 
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gttaacggtg ccagttttac ccttgaacag atgctcgcct ggcgggacca tccgcaggtt 540 

accggccttg cagaaatgat ggactaccct ggcgtaatta gcgggcagaa tgcgctgctc 600 

gataaactgg atgcatttcg ccacctgacg ctggacggtc actgcccggg tttgggtggt 660 

aaagaactta acgcctatat tactgcgggt attgaaaact gccacgaaag ttatcagctg 720 

gaagaaggac gccggaaatt acaactcggc atgtcgttga tgatccgcga agggtccgct 780 

gcccgcaatc tcaacgcgct ggcaccgttg atcaacgaat ttaacagccc gcaatgcatg 840 

ctctgtaccg atgaccgtaa cccgtgggag atcgcccatg aaggacacat cgatgcctta 900 

attcgccgcc tgatcgaaca acacaatgtg ccgctgcatg tggcatatcg cgtcgccagc 960 

tggtcgacgg cgcgccactt tggtctgaat cacctcggct tactggcacc cggcaagcag 1020 

gccgatatcg tcctgttgag cgatgcgcgt aaggtcacgg tgcagcaggt actggtgaaa 1080 

ggcgagccga ttgatgcgca aaccttacag gcggaagagt cggcgagact ggcacaatcc 1140 

gctccgccat atggcaacac cattgcccgc cagccagttt ccgccagcga ctttgccctg 1200 

caatttacgc ccggaaaacg ctatcgggtc attgacgtca tccataacga attgattacg 1260 

cactcccact ccagcgtcta cagcgaaaat ggttttgatc gcgatgatgt gagctttatt 1320 

gccgtacttg agcgttacgg gcaacggctg gctccggctt gtggtttgct tggcggcttt 1380 

ggactgaatg aaggtgcgct ggctgcgacg gtcagccatg acagccataa tattgtggtg 1440 

atcggtcgca gtgccgaaga gatggcgctg gcggtcaatc aggtgattca ggatggcggc 1500 

gggctgtgcg tggtacgtaa cggccaggta caaagtcatc tgccgttacc cattgccggg 1560 

ctgatgagca ccgacacggc gcagtcgctg gcggaacaaa ttgacgcctt gaaagccgcc 1620 

gcccgtgaat gcggtccgtt acccgatgag ccgtttattc agatggcgtt tctttctctg 1680 

ccagtgatcc ccgcgctaaa actaaccagt caggggctat ttgatggcga gaagtttgcc 1740 

ttcactacgc tggaagtcac ggaataa 1767 

<210> 95 
<211> 1227 
<212> DNA 
<213> E. Coli 

<400> 95 

atggcgtatt gcaatccggg cctggaatcc aggccgaata agagaaacgc cctccggcgt 60 

catgtggtaa caggcatagg tatgaaaatc gtaatcgccc cagactctta taaagaaagt 120 

ttatctgcca gcgaggttgc gcaggcgata gaaaaaggat ttcgggaaat ttttcctgat 180 

gcacagtacg tttctgttcc ggttgccgac ggtggcgaag gaacggtgga agcgatgatt 240 

gcagccaccc agggggctga acgtcacgcc tgggttacag ggccgctggg cgagaaagtg 300 

aatgccagtt gggggatctc cggcgatggc aaaaccgcgt ttattgaaat ggcggcggcc 360 

agtgggctgg agctggtacc tgcggaaaaa cgcgatccac tcgtgaccac ttcacgcggc 420 

acaggcgagt taatcctgca ggcgctggag agcggtgcga caaacattat tatcggcatt 480 

ggcggcagcg ctacaaatga tggcggcgca ggcatggtac aggcgctggg ggcgaaatta 540 

tgcgacgcca acggcaatga aattggtttt ggcggcggta gtcttaatac tctgaatgat 600 

attgatattt ccggcctcga tccgcgctta aaagattgcg tcattcgcgt cgcttgtgat 660 

gtcaccaatc cgctggtggg cgataacggc gcatcgcgca tctttggccc acaaaaggga 720 

gccagrgaag cgatgattgt tgagctggac aataacctct ctcactatgc cgaggtcatt 780 

aaaaaagcgc tgcatgttga tgtgaaagat gtccccggtg caggagctgc gggtggtatg 84 0 

ggcgcggcgc taatggcgtt tcttggtgcg gaactgaaaa gtggtattga aatcgtcact 900 

acggcgctga atctggagga acatattcac gattgtacgc tggtgatcac cggtgaaggg 960 

cgtattgaca gccagagtat tcacgggaag gtaccgattg gtgtcgcaaa cgtggcgaag 1020 

aagtaccata aaccggtgat tggcattgcg ggtagcctga ccgatgatgt tggcgttgta 1080 

catcagcatg gcattgatgc ggtcttcagc gtattgacca gcataggtac gttggacgaa 1140 

gcattccgcg gggcttatga caatatctgc cgtgcttcac gtaatatcgc cgcgacactg 1200 

gcgatrggaa tgcgcaacgc ggggtga 1227 

<210> 96 
<211> 900 
<212> DNA 
<213> E. Coli 

<400> 96 

atgartgata tgactatgaa agttggtttt attggcctgg ggattatggg taaaccaatg 60 

agtaaaaacc ttctgaaagc aggttactcg ctggtggt.tg ctgaccgtaa cccagaagct 120 

artgctgacg tgattgctgc aggtgcagaa acagcgtcta cggctaaagc gatcgctgaa 180 

cagtgcgacg tcatcataac catgctgcca aactcccctc atgtgaaaga ggtggcgctg 240 
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ggtgagaatg gcattattga aggcgcgaag ccaggtacgg tattgatcga tatgagttct 300 

atcgcaccgc tggcaagccg tgaaatcagc gaagcgctga aagcgaaagg cattgatatg 360 

ctggatgctc cggtgagcgg cggtgaaccg aaagccatcg acggtacgct gtcagtgatg 420 

gtgggcggcg acaaggctat tttcgacaaa tactatgatt tgatgaaagc gatggcgggt 480 

tccgtggtgc ataccgggga aatcggtgca ggtaacgtca ccaaactggc aaatcaggtc 540 

attgtggcgc tgaatattgc cgcgatgtca gaagcgttaa cgctggcaac taaagcgggc 600 

gttaacccgg acctggttta tcaggcaatt cgcggtggac tggcgggcag taccgtgctg 660 

gatgccaaag cgccgatggt gatggaccgc aacttcaagc cgggcttccg tattgatctg 720 

catattaagg atctggcgaa tgcgctggat acttctcacg gcgtcggcgc acaactgccg 780 

ctcacagctg cggttatgga gatgatgcag gcactgcgag cagatggttt aggaacggcg 840 

gatcatagcg ccctggcgtg ctactacgaa aaactggcga aagtcgaagt tactcgttaa 900 

<210> 97 
<211> 771 
<212> DNA 
<213> E. Coli 

<400> 97 

atgaataacg atgttttccc gaataaattc aaagccgcac tggctgcgaa acaggtacaa 60 

attggttgct ggtcagcact ctctaacccg attagcactg aagttcttgg tttggctggg 120 

tttgactggc tggtgctgga tggcgaacat gcgccaaacg atatctccac gtttattccg 180 

cagttaatgg ccttgaaagg cagcgccagc gcgccagtag tgcgagtgcc gaccaacgag 240 

ccggtaatta ttaagcgtct tctggatatc ggtttctata acttcctgat tccttttgta 300 

gaaacaaaag aggaagcaga gctggcggtg gcatcaaccc gttacccacc ggaaggcatt 360 

cgcggcgtct ccgtttctca ccgcgccaat atgtttggca ccgtggcgga ttatttcgct 420 

cagtcgaaca agaacatcac tattctggtc cagatagaaa gtcagcaggg cgtagataac 480 

gtcgatgcca ttgccgctac cgaaggcgta gacggcatct tcgtcggccc cagcgatctg 540 

gccgcggcat taggccatct cggcaatgca tcacacccgg atgtacaaaa agcaattcag 600 

cacattttta accgtgccag cgcgcacggc aaacccagcg gtatcctcgc gccggtcgaa 660 

gccgatgcgc gtcgttatct ggaatggggc gcgacgtttg tggctgtcgg cagcgatctc 720 

ggcgtcttcc gctctgccac tcagaaactg gctgatacct ttaaaaaata a 771 

<210> 98 

<211> 1335 

<212> DNA 

<213> E . Coli 

<400> 98 

atgattctgg acaccgttga cgaaaaaaag aaaggcgtgc atacccgcta tttaatatta 
ctgattattt ttattgttac cgccgttaac tacgccgatc gtgcaacgct gtctattgct 
ggtaccgaag tggcaaaaga gttgcagtta agtgcggttt cgatgggtta catcttctcc 
gcttttggct gggcctactt gctgatgcaa atccccggcg gctggctgct tgataagttt 
ggctcgaaaa aagtttacac ctacagcctc tttttctggt cgctattcac cttcctgcaa 
ggctttgttg atatgttccc gctggcctgg gcagggatct ccatgttctt tatgcgcttt 
atgctcggct tctcggaagc gccatcattc ccggcgaacg cccgaattgt cgccgcctgg 
ttcccgacga aagaacgtgg tactgcctcc gccatcttta actcggcgca atatttctcg 
ctggcgctct tttcgccgct gcttggctgg ctgactttcg cctggggctg ggagcacgtc 
tttaccgtta tgggggtgat tggttttgtg ctgacggcgc tgtggatcaa gttgattcat 
aacccgacag atcacccacg tatgtctgcg gaagagctga agtttatctc tgaaaatggc 
gcggtggtcg atatggacca caaaaagccg ggcagtgcgg cagcaagcgg acccaaactg 
cattacatca agcaattgct ctctaaccgc atgatgctgg gcgtattttt cggacaatat 
tttatcaaca ccatcacctg gttcttcctc acctggttcc cgatttatct ggtgcaggaa 
aaaggcatgt cgattctgaa agtgggtctg gtcgcctcga ttccagcact gtgtggtttt 
gcgggcggcg tgctgggagg tgtcttctcg gattatctga tcaaacgcgg tttatccctg 
accctggcac gtaagctacc gattgtgctg ggaatgttgc tggcttccac catcatctta 
tgtaactaca ccaacaacac cacgctggtg gtcatgctga tggcgctggc tttctttggc 
aaaggatttg gtgcgctggg ctggccggtg atttctgaca ccgcgccgaa agagattgtt 
ggcctctgcg gcggcgtctt taacgtcttt ggcaatgttg cctccattgt cactccactg 
gtgattggct acctggtaag tgaactgcac tccttcaatg cagcactggt tttcgtggga 
tgttcagcgc tgatggcgat ggtctgctac ctcttcgtag ttggcgacat taaacgtatg 
gaattgcaga aataa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1335 
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<210> 99 
<211> 1536 
<212> DNA 
<213> E. Coli 



<400> 99 

atgcaaacga gtgatacccg cgcgttaccg ctactttgcg cccgctcggt ttataaacag 60 

tattcagggg tcaatgtcct gaaaggcatc gattttacgt tgcatcaggg ggaggtccac 120 

gccctgctcg gcggcaatgg tgccggtaaa tcgacgttaa tgaagattat tgccggtatt 180 

acccctgctg atagcggtac gctggagatt gagggcaaca actacgtcag attaacgcca 240 

gttcatgctc atcagctggg tatttatctc gttccccagg aaccgctgct tttcccaagc 300 

ctgtcgataa aagaaaacat cctgtttggg ctggcaaaaa aacagctctc catgcagaaa 360 

atgaagaact tgctggcggc gctgggctgc cagtttgatc tgcatagtct ggcaggatcg 420 

ctggatgtcg ccgatcgcca aatggtggaa atcctccgcg ggctgatgcg cgactcgcgg 4 80 

attctgatcc tcgatgaacc taccgcctcg cttacccctg cggaaaccga acgcttgttt 540 

agtcgcttgc aagagctgct tgctactggc gtgggtattg tttttatctc gcataagctg 600 

ccggaaattc gccagattgc cgatcgaatt agcgtgatgc gcgacggaac catcgcctta 660 

agcggcaaaa ccagcgaact gtctaccgac gacattattc aggccatcac cccagcggta 720 

cgggaaaaat cgctctctgc cagccaaaaa ttatggctgg agttacctgg taaccgccca 780 

caacatgccg ccggaacgcc ggtgctgaca ctggaaaatc tgaccggcga aggtttcagg 840 

aatgtcagcc tgacgctcaa tgccggagaa attctgggcc tggctgggct ggtgggggcc 900 

ggacgcacag aactggccga gacgctctat ggtctgcgta ctttgcgtgg cggacgcatt 960 

atgctgaatg gtaaagagat caataaatta tccactggag aacgtttact gcgcggtctg 1020 

gtttatctgc cggaagatcg ccagtcatcc ggactgaatc tcgatgcttc gctggcctgg 1080 

aacgtctgcg cccttactca taaccttcgt ggattctggg cgaaaaccgc gaaagataat 1140 

gccaccctgg aacgttatcg tcgggcgctg aatattaaat tcaaccaacc ggaacaagct 1200 

gcacggacat tatccggtgg caaccagcaa aaaatcctca ttgccaaatg cttggaagct 1260 

tcgccgcaag tattgattgt cgatgagccg acgcgcggcg tggatgtctc ggcccgtaat 1320 

gatatctacc agctgttgcg cagcatcgcc gcacaaaatg tggctgtgct gcttatctcc 1380 

tccgacctgg aagagatcga actgatggca gatcgtgtgt atgtgatgca tcagggcgaa 1440 

attacccact ctgcactgac cgagcgcgat attaatgtcg agactattat gcgcgttgcc 1500 

ttcggcgata gtcagcgtca ggaggcgtca tgctga 1536 

<210> 100 
<211> 1029 
<212> DNA 
<213> E. Coli 



<400> 100 

atgctgaagt ttattcagaa caaccgtgaa atcacggcac tgctggcggt ggtgctgctg 60 

tttgtattac ccggttttct cgaccgccag tatttaagtg tgcaaacgct gaccatggtt 120 

tatagcagcg cgcaaatcct gatcctgctg gcaatgggcg cgacgctggt aatgcttacg 180 

cgcaatattg atgtttcagt gggttcgatt accggaatgt gcgcggtgct gttggggatg 240 

ttactgaacg caggatattc actacctgtt gcttgtgtcg cgactttact gcttggtttg 300 

ctcgcgggat ttttcaacgg tgtcctggtc gcgtggctaa agatccctgc cattgttgcc 360 

acccttggca cgttagggtt gtacagaggc atcatgttgc tgtggactgg cggcaaatgg 420 

attgaagggt tacccgccga actgaaacag ctctccgccc cgctgctgct tggcgtttca 480 

gcaattggtt ggttgacgat aattctggtg gcatttatgg cctggctgct ggcaaagacg 540 

gcgtttggac gcagttttta tgccacgggc gataatttac agggcgctcg tcaactgggc 600 

gttcgtactg aagccattcg cattgtggca ttttcgttga acggctgcat ggcggcactg 660 

gcgggaattg tgtttgcttc gcagattggt tttatcccca accagaccgg taccgggctg 720 

gagatgaaag caattgcagc ctgcgtgctg ggcggcatta gtttgctcgg tggttccggt 780 

gcgatcattg gtgcggtact cggcgcatgg ttcctgacgc agatcgatag cgtactggtg 84 0 

ctgttgcgca ttccggcatg gtggaatgat tttatcgcgg 'gtctggttct gctggcggtg 900 

ctggtgtttg atggacgcct gcgttgtgcg ctggaacgta atctacggcg gcaaaaatat 960 

gcccgcttta tgacgccacc gccatccgtt aaacccgctt cgtcaggtaa aaaacgggag 1020 

gccgcataa 1029 

<210> 101 
<211> 993 
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<212> DNA 
<213> E. Coli 



<400> 101 

atgcgtattc gctacggttg ggaactggct cttgccgcac tgctcgttat tgagattgtc 60 

gcatttggtg caattaaccc gcgaatgtta gatctcaata tgttgctgtt cagcaccagt 120 

gactttatct gcattggcat tgtcgcccta ccgctaacga tggtgattgt cagtggcggg 180 

atcgatattt cgtttggttc gaccatcggc ctctgcgcca ttgcattggg cgtactgttt 240 

caaagtggtg tgccgatgcc gctggcgata ctcctgacct tactgctcgg cgcattgtgc 300 

gggctgatca acgccggatt aattatctat accaaagtta acccgctggt gattacgctt 360 

ggcacgctgt atctgtttgc cggaagcgct ctgctgcttt ccggtatggc cggagcgacg 420 

gggtacgaag gtattggtgg attcccgatg gcgtttacag atttcgctaa cctggatgtg 480 

ctgggactcc ccgttccgct gattatcttc ctgatatgtc tcctcgtttt ctggctctgg 540 

ctgcataaaa cccatgccgg acgtaatgtg tttttgattg ggcaaagccc gcgcgtggcg 600 

ctttatagcg cgattccagt taaccgtacc ttatgtgcgc tctatgccat gacggggctg 660 

gcgtctgcgg tcgccgctgt gctgctggta tcgtattttg gttcagcacg ttccgatctc 720 

ggtgcgtcgt ttctgatgcc cgccatcacc gccgtggtgc ttggcggggc caatatttat 780 

ggtggttccg gttccattat cggcaccgcc attgcggttt tattagtggg atatttgcaa 840 

caaggtttgc aaatggcagg agtgccaaat caggtgtcca gcgccctttc cggtgcgcta 900 

cttatcgtcg ttgtcgtagg tcgttccgtt agcctgcatc gccagcaaat taaagagtgg 960 

ctggcgcgtc gggccaataa cccattgcca taa " ' 993 

<210> 102 
<211> 1023 
<212> DNA 
<213> E. Coli 



<400> 102 

atgacacttc atcgctttaa gaaaatcgcc ttacttagcg ctcttggcat tgccgcaatc 60 

tctatgaatg tgcaggccgc agagcgtatt gcatttattc ccaaactggt tggcgtggga 120 

ttttttacca gcggtggcaa cggcgcacaa caagcgggta aagagctggg cgttgatgtg 180 

acctacgacg ggccgacaga acccagtgtt tctggtcagg tacagttgat taataacttc 240 

gtcaatcaag gttataacgc cattatcgtt tctgcggttt cgcctgatgg cttgtgtccg 300 

gcactgaaac gcgccatgca acgtggtgtg agagtgctga cctgggactc tgatactaaa 360 

ccggagtgcc gctcttacta cattaatcag ggaacgcccg cccagttagg aggtatgttg 420 

gtggatatgg cggcgcgtca ggtgaataaa gacaaagcca aagtcgcgtt tttctactca 480 

agccccaccg ttacggacca aaaccagtgg gtgaaagaag cgaaagcgaa aatcgccaaa 540 

gagcatccag gctgggaaat tgtcactacg cagtttggct ataacgatgc cactaaatcg 600 

ttacaaaccg cagaaggaat attaaaagcg tatagcgatc tcgacgccat tatcgccccc 660 

gatgccaacg ccctgcccgc tgccgcacaa gccgcagaaa acttgaaaaa tgacaaagta 720 

gcgattgtcg gattcagtac gccaaatgtg atgcgcccgt atgtagagcg cggcacggtg 780 

aaagaatttg gcctgtggga tgtggttcag caaggcaaaa tttcagtgta tgtcgcggat 840 

gcattattga aaaaaggatc aatgaaaacg ggcgacaagc tggatatcaa gggcgtaggt 900 

caggttgaag tctcgccaaa cagcgttcag ggctatgact acgaagcgga tggtaatggc 960 

atcgtactgt taccggagcg cgtgatattc aacaaagaga atatcggcaa atacgatttc 1020 

tga 1023 

<210> 103 
<211> 876 
<212> DNA 
<213> E. Coli 



<400> 103 

atggcagatt tagacgatat taaagatggt aaagattttc gtaccgatca accgcaaaaa 60 

aatatccctt ttaccctgaa aggttgcggt gcgctggatt ggggaatgca gtcacgctta 120 

tcgcggatat ttaatccgaa aacgggtaaa accgtgatgc tggcttttga ccatggttat 180 

tttcagggac cgactaccgg acttgaacgc attgatataa atatcgcccc gctgtttgaa 240 

catgccgatg tattaatgtg tacgcgcggc attttgcgca gcgtagttcc ccctgcgacc 300 

aataggccgg tggtactgcg ggcgtcaggt gcgaactcta ttctggcgga attaagtaat 360 

gaagccgtgg cgttatcgat ggatgacgcc gtgcgcctga acagttgcgc ggtggcggcg 420 

caggtttata tcggcagcga atatgaacat cagtcgatca aaaatattat tcagctggtt 480 
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gatgccggaa tgaaagtggg aatgccgacc atggccgtga ctggcgtggg caaagatatg 540 

gtgcgcgatc agcgttattt ctcgctcgcg actcgaatcg ccgctgaaat gggggcgcaa 600 

attatcaaaa cctattatgt cgaaaaaggt tttgaacgga ttgttgccgg atgtccggta 660 

cccattgtta ttgctggcgg taaaaaatta ccggagcgcg aggcgctgga aatgtgctgg 720 

caggctatcg atcagggcgc ttctggtgtg gatatggggc gtaatatttt ccagtctgac 780 

catccggtgg cgatgatgaa agccgtacag gcggtggttc accataacga aacggctgat 840 

cgggcatatg aactctatct gagtgaaaaa cagtaa 37^ 

<210> 104 
<211> 291 
<212> DNA 
<213> E. Coli 



<400> 104 

atgcacgtca cactggttga aattaacgtt catgaagaca aggttgacga gtttatcgaa 60 

gtttttcgcc agaaccacct gggctctgta caggaagaag gcaatttgcg cttcgatgtc 120 

ttacaggacc cggaagtgaa ttcgcgcttt tatatctacg aagcctataa agatgaagac 180 

gcagtggcgt tccataaaac cacgccccac tacaaaacct gtgtcgcgaa actggaatct 240 

ttaatgaccg ggccgcgtaa aaaacgtctg ttcaatggtt tgatgccgtg a 291 

<210> 105 
<211> 1152 
<212> DNA 
<213> E . Coli 



<400> 105 

atgtttgaac caatggaact taccaatgac gcggtgatta aagtcatcgg cgtcggcggc 60 

ggcggcggta atgctgttga acacatggtg cgcgagcgca ttgaaggtgt tgaattcttc 120 

gcggtaaata ccgatgcaca agcgctgcgt aaaacagcgg ttggacagac gattcaaatc 180 

ggtagcggta tcaccaaagg actgggcgct ggcgctaatc cagaagttgg ccgcaatgcg 240 

gctgatgagg atcgcgatgc attgcgtgcg gcgctggaag gtgcagacat ggtctttatt 300 

gctgcgggta tgggtggtgg taccggtaca ggtgcagcac cagtcgtcgc tgaagtggca 360 

aaagatttgg gtatcctgac cgttgctgtc gtcactaagc ctttcaactt tgaaggcaag 420 

aagcgtatgg cattcgcgga gcaggggatc actgaactgt ccaagcatgt ggactctctg 480 

atcactatcc cgaacgacaa actgctgaaa gttctgggcc gcggtatctc cctgctggat 54 0 

gcgtttggcg cagcgaacga tgtactgaaa ggcgctgtgc aaggtatcgc tgaactgatt 600 

actcgtccgg gtttgatgaa cgtggacttt gcagacgtac gcaccgtaat gtctgagatg 660 

ggctacgcaa tgatgggttc tggcgtggcg agcggtgaag accgtgcgga agaagctgct 720 

gaaatggcta tctcttctcc gctgctggaa gatatcgacc tgtctggcgc gcgcggcgtg 780 

ctggttaaca tcacggcggg cttcgacctg cgtctggatg agttcgaaac ggtaggtaac 840 

accatccgtg catttgcttc cgacaacgcg actgtggtta tcggtacttc tcttgacccg 900 

gatatgaatg acgagctgcg cgtaaccgtt gttgcgacag gtatcggcat ggacaaacgt 960 

cctgaaatca ctctggtgac caataagcag gttcagcagc cagtgatgga tcgctaccag 1020 

cagcatggga tggctccgct gacccaggag cagaagccgg ttgctaaagt cgtgaatgac 1080 

aatgcgccgc aaactgcgaa agagccggat tatctggata tcccagcatt cctgcgtaag 1140 

caagctgatt aa ~ " 1152 

<210> 106 
<211> 3048 
<212> DNA 
<213> E. Coli 



<400> 106 

atggacgtca gtcgcagaca attttttaaa atctgcgcgg gcggtatggc tggaacaaca 60 

gtagcggcat tgggctttgc cccgaagcaa gcactggctc aggcgcgaaa ctacaaatta 120 

ttacgcgcta aagagatccg taacacctgc acatactgtt ccgtaggttg cgggctattg 180 

atgtatagcc tgggtgatgg cgcaaaaaac gccagagaag cgatttatca cattgaaggt 240 

gacccggatc atccggtaag ccgtggtgcg ctgtgcccga aaggggccgg tttgctggat 300 

tacgtcaaca gtgaaaaccg tctgcgctac ccggaatatc gtgcgccagg ttctgacaaa 360 

tggcagcgca ttagctggga agaagcattc tcccgtattg cgaagctgat gaaagctgac 420 

cgtgacgcta actttattga aaagaacgag cagggcgtaa cggtaaaccg ttggctttct 480 
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accggtatgc tgtgtgcctc cggtgccagc aacgaaaccg ggatgctgac ccagaaattt 540 

gcccgctccc tcgggatgct ggcggtagac aaccaggcgc gcgtctgaca cggaccaacg 600 

gtagcaagtc ttgctccaac atttggtcgc ggtgcgatga ccaaccactg ggtggatatc 660 

aaaaacgcta acgtcgtgat ggtgatgggc ggtaacgctg ctgaagcgca tcccgtcggt 720 

ttccgctggg cgatggaagc gaaaaacaac aacgacgcaa ccttgatcgt tgtcgatccc 780 

cgttttacgc gtaccgcttc tgtggcggat atttacgcgc ctattcgttc cggtacggac 840 

attacgttcc tgtctggcgt tttgcgctac ctgatcgaaa acaacaaaat caacgccgaa 900 

tacgttaagc attacaccaa cgccagcctg ctggtgcgtg atgattttgc tttcgaagac 960 

ggtctgttca gcggctacga cgctgaaaaa cgtcaatacg ataaatcgtc ctggaactat 1020 

cagctcgatg aaaacggcta tgcgaaacgc gatgaaacac tgactcatcc gcgctgtgtg 1080 

tggaacctgc tgaaagagca cgtttcccgc tacacgccgg acgtcgttga aaacatctgc 1140 

ggtacgccaa aagccgactt cctgaaagtg tgtgaagtgc tggcctccac cagcgcaccg 1200 

gatcgcacaa ccaccttcct gtacgcgctg ggctggacgc agcacactgt gggtgcgcag 1260 

aacatccgta ctatggcgat gatccagttg ctgctcggta acatgggtat ggccggtggc 1320 

ggcgtgaacg cattgcgtgg tcactccaac attcagggct tgactgactt aggcctgctc 1380 

tctaccagcc tgccaggtta tctgacgctg ccgtcagaaa aacaggttga tttgcagtcg 1440 

tatctggaag cgaacacgcc gaaagcgacg ctggctgatc aggtgaacta ctggagcaac 1500 

tatccgaagt tcttcgttag cctgatgaaa tctttctatg gcgatgccgc gcagaaagag 1560 

aacaactggg gctatgactg gctgccgaag tgggaccaga cctacgacgt catcaagtat 1620 

ttcaacatga tggatgaagg caaagtcacc ggttatttct gccagggctt taacccggtt 1680 

gcgtccttcc cggacaaaaa caaagtggtg agctgcctga gcaagctgaa gtacatggtg 1740 

gttatcgatc cgctggtgac tgaaacctct accttctggc agaaccacgg tgagtcgaac 1800 

gatgtcgatc cggcgtctat tcagactgaa gtattccgtc tgccttcgac ctgctttgct 1860 

gaagaagatg gttctatcgc taactccggt cgctggttgc agtggcactg gaaaggtcag 1920 

gacgcgccgg gcgaagcgcg taacgacggt gaaattctgg cgggtatcta ccatcatctg 1980 

cgcgagctgt accagtccga aggtggtaaa ggcgtagaac cgctgatgaa gatgagctgg 2040 

aactacaagc agccgcacga accgcaatct gacgaagtgg ctaaagagaa caacggctac 2100 

gcgctggaag atctctatga cgctaatggc gtgcttattg cgaagaaagg tcagttgctg 2160 

agtagctttg cgcatctgcg tgatgacggt acaaccgcat cttcttgctg gatctacacc 2220 

ggtagctgga cagagcaggg caaccagatg gctaaccgcg ataactccga cccgtccggt 2280 

ctggggaata cgctgggatg ggcctgggcg tggccgctca accgtcgcgt gctgtacaac 2340 

cgtgcttcgg cggatatcaa cggtaaaccg tgggatccga aacggatgct gatccagtgg 2400 

aacggcagca agtggacggg taacgatatt cctgacttcg gcaatgccgc accgggtacg 2460 

ccaaccgggc cgtttatcat gcagccggaa gggatgggac gcctgtttgc catcaacaaa 2520 

atggcggaag gtccgttccc ggaacactac gagccgattg aaacgccgct gggcactaac 2580 

ccgctgcatc cgaacgtggt gtctaacccg gttgttcgtc tgtatgaaca agacgcgctg 2640 

cggatgggta aaaaagagca gttcccgtat gtgggtacga cctatcgtct gaccgagcac 2700 

ttccacacct ggaccaagca cgcattgctc aacgcaattg ctcagccgga acagtttgtg 2760 

gaaatcagcg aaacgctggc ggcggcgaaa ggcattaata atggcgatcg tgtcactgtc 2820 

tccagcaagc gtggctttat ccgcgcggtg gctgtggtaa cgcgtcgtct gaaaccgctg 2880 

aatgtaaatg gtcagcaggt tgaaacggtg ggtattccaa tccactgggg ctttgagggt 2940 

gtcgcgcgta aaggttatat cgctaacact ctgacgccga atgtcggtga tgcaaactcg 3000 

caaacgccgg aatataaagc gttcttagtc aacatcgaga aggcgtaa 3048 

<210> 107 
<211> 885 
<212> DNA 
<213> E. Coli 

<400> 107 

atggctatgg aaacgcagga cattatcaaa aggtccgcaa ctaactccat cacgccgcct 60 

tctcaggtgc gtgattacaa agcagaagtc gcaaaactta tcgacgtttc cacctgtatc 120 

ggctgtaaag cctgtcaggt ggcgtgttcg gagtggaacg acatccgtga tgaagtgggg 180 

cactgcgtcg gggtttacga taaccccgcc gatctgagcg ccaagtcctg gacggtgatg 240 

cgctttagcg aaaccgaaca gaacggcaag ctggagtggc tgatccgtaa agacggctgt 300 

atgcactgtg aagatcccgg ctgcctgaag gcgtgcccgt ctgctggtgc aatcattcag 360 

tacgctaacg ggattgtcga tttccagtcg gaaaactgca tcggctgtgg ttactgcatt 420 

gccgggtgtc cgtttaatat tccgcgcctc aacaaagagg ataaccgggt atataaatgc 480 

acgctctgcg tcgatcgcgt cagcgtcggc caggaaccgg cttgtgtgaa aacctgtccg 540 

accggggcta tccacttcgg caccaagaag gagatgctgg agctggcgga acagcgcgtg 600 

gcgaaactga aagcgcgtgg ttacgaacat gctggcgtct acaacccgga aggggtcggt 660 
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ggtacgcacg ttatgtacgt gctgcatcac gccgatcagc cggagctgta tcacggtctg 720 

ccgaaagatc cgaagatcga cacctcggta agcctgtgga aaggcgcgtt gaaaccgctg 780 

gcagcggctg gctttattgc cacttttgcc gggttgattt tccactacat cggtattggc 840 

ccgaataagg aagtggacga tgacgaggag gatcatcatg agtaa ~ 885 

<210> 108 
<21I> 654 
<212> DNA 
<213> E . Coli 



<400> 108 

atgagtaagt cgaaaatgat tgtgcgcacc aaatttattg atcgcgcctg tcactggacc 60 

gtggtgattt gcttcttcct ggtggcgctg tccgggattt cgttcttctt cccgacgctg 120 

caatggctga cgcaaacctt cggtacgccg cagatgggac gcattttgca cccgttcttc 180 

ggcattgcga ttttcgtcgc actgatgttt atgtttgtgc gttttgtgca tcacaacatc 240 

ccggataaga aagatattcc gtggctgttg aacattgtcg aagtattgaa aggcaatgag 300 

cataaagtgg cggatgtcgg taagtacaac gccgggcaaa agatgatgtt ctggtcgatc 360 

atgagcatga ttttcgtgct gctggtgacc ggggtgatta tctggcgtcc gtactttgcg 420 

cagtacttcc cgatgcaggt tgttcgctac agcctgctga tccacgcggc tgcgggtatc 480 

atcctgatcc acgccatcct gatccatatg tatatggcat tttgggtgaa aggatcgatt 540 

aaagggatga tcgaagggaa ggtaagtcgt cgctgggcga agaaacacca tccgcgctgg 600 

tatcgtgaaa tcgagaaggc agaagcgaaa aaagagagtg aagaagggat ataa 654 

<210> 109 
<211> 261 
<212> DNA 
<213> E. Coli 



<400> 109 

atggcgttgt taatcactaa aaaatgcatc aattgtgata tgtgtgaacc cgaatgcccg 60 

aatgaggcga tttcaatggg agatcatatc tacgagatta acagcgataa gtgtaccgaa 120 

tgcgtagggc actacgagac accaacctgc cagaaggtgt gcccgatccc caatactatt 180 

gtgaaagatc cggcgcatgt cgagacagaa gaacagttgt gggataaatt tgtgctgatg 240 
caccacgcgg ataaaattta a - - ~ 

<210> 110 
<211> 1203 
<212> DNA 
<213> E. Coli 



<400> 110 

atgcaaagtg ttgatgtagc cattgttggc ggcggcatgg tggggctggc ggttgcctgt 60 

ggcttacagg ggagcggctt acgcgttgcc gtactggagc agcgcgtaca ggaacctctg 120 

gcggcgaatg caccaccaca actgcgcgtt tcggctatca atgccgccag cgaaaaatta 180 

ctcacccgtc ttggcgtctg gcaggacatt ctctctcgta gggccagctg ttatcacggt 24 0 

atggaagtgt gggacaaaga cagctttggt cacatttcgt ttgacgatca aagcatgggc 300 

tatagccatc ttgggcatat cgttgaaaat tcagtgattc actacgcgct gtggaacaaa 360 

gcgcatcagt cgtcagatat cactctgtta gcccccgcag aattacagca ggtcgcctgg 420 

ggagaaaatg aaaccttcct gacgctgaaa gatggcagca tgttaacggc gcgtctggtg 480 

attggcgcgg acggcgctaa ttcctggttg cgcaacaaag ccgatattcc gctgactttc 540 

tgggattatc agcatcacgc gctggtagcg accattcgca cggaagaacc gcatgatgcg 600 

gtggcgcggc aggttttcca tggcgaaggc attctggcct ttttaccgct tagcgatccg 660 

catctttgct cgattgtctg gtcactgtcg ccagaggaag cgcagcggat gcagcaggca 720 

agtgaagacg aatttaatcg cgcgttaaat atcgcttttg ataatcgcct gggcttatgc 780 

aaggttgaga gcgcgcgtca ggtgttccca ctgacggggc gttatgcgcg ccagtttgcc 840 

tcgcaccgtc tggcgctggt gggcgacgcc gcacatacca ttcacccgct ggcggggcag 900 

ggggtaaatc tcggctttat ggatgctgca gagctgattg ccgaactgaa acggttgcat 960 

cgtcagggga aagacatcgg gcagtacatt tatctgcgtc gctatgagcg tagccgcaag 1020 

cacagtgcgg cgttgatgct ggctggtatg cagggattcc gcgatctgtt ttccggtacc 1080 

aatccggcga aaaaactgct gcgtgatatt ggtttgaaac tggccgacac gcttcctggc 1140 

gttaagccgc aacttatccg ccaggcaatg ggattaaacg atttgcctga atggctgcgt 1200 
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taa 



1203 



<210> 111 
<211> 1179 
<212> DNA 
<213> E. Coli 

<400> 111 

atgagcgtaa tcatcgtcgg tggcggcatg gcgggcgcga cgctggcgct ggctatttcc 60 

cggttaagtc acggggcgct gccggtacat ttgattgaag cgactgcgcc agagtcacat 120 

gctcatccgg gctttgatgg acgagcgata gcgctggcgg cgggtacctg tcagcaactg 180 

gcgcgcatcg gcgtctggca atctctggcg gattgcgcaa ctgccatcac caccgtgcat 240 

gtcagcgatc gtggtcacgc tggatttgtc accctcgccg cagaagatta ccaactggcg 300 

gcgctgggac aggttgtcga attgcacaat gtcgggcaac ggctgtttgc attgctgcgt 360 

aaagcacctg gcgtaacgct gcattgccct gatcgcgtgg ctaacgttgc ccgtactcag 420 

agtcacgttg aagtgacgct ggagagtggc gagacgctga cgggccgcgt gctggtagca 480 

gctgatggca cccattcagc gttagccacc gcgtgcggcg ttgactggca gcaggagcct 540 

tacgaacaac tggccgtgat tgccaacgtt gctacttccg ttgcgcatga agggcgcgct 600 

tttgaacgct ttacgcaaca tggcccgctg gcgatgttgc cgatgtctga cggacgctgt 660 

tcgctggtct ggtgtcatcc actggaacgg cgcgaagagg tgttgtcgtg gagtgacgag 720 

aagttttgcc gtgaactcca gtcggccttt ggctggcgac ttgggaaaat tacccacgct 780 

ggtaaacgca gtgcttatcc gctggcgtta acccacgccg ccagatctat tacccatcgt 840 

accgtgctgg tgggcaatgc ggcgcaaact ctgcacccga ttgccgggca agggtttaac 900 

ctcggtatgc gagatgtgat gagtcttgcg gaaaccctga ctcaggcgca ggagcgcgga 960 

gaagacatgg gggattacgg cgtattgtgc cgttatcagc agcgtcgaca gagcgatcgc 1020 

gaagcaacca ttggcgtcac ggacagcctt gtacatcttt ttgccaaccg ttgggcaccg 1080 

ctggttgtcg ggcgcaacat cgggctgatg acgatggaat tattcacccc ggcacgcgat 1140 

gtgctggcgc agcgcaccct cggttgggtg gcgcgttga 1179 

<210> 112 
<211> 1326 
<212> DNA 
<213> E. Coli 



<400> 112 

atgagtgaga tatcccggca agagtttcag cgtcgccgtc aggccctggt ggagcaaatg 60 

caacccggca gcgccgcgct gatttttgct gcaccagaag taacacgtag cgccgacagc 120 

gaatacccct atcgtcagaa cagtgacttc tggtacttca ccggctttaa cgaaccggaa 180 

gcggtgctgg tgctgattaa aagcgatgac actcataacc acagcgttct gtttaaccgc 240 

gttcgcgacc tgacggcgga gatctggttt ggccgtcgct taggccagga tgccgcgcca 300 

gagaaactgg gcgttgaccg cgcactggca ttcagcgaaa tcaatcagca actttatcaa 360 

ctacttaacg gcctggatgt ggtttaccat gcccagggcg aatatgcata tgctgatgta 420 

atcgtgaaca gtgcgctgga aaaactgcgt aaaggttcgc ggcaaaatct caccgcaccg 480 

gcaacgatga tcgactggcg tcctgttgtt catgaaatgc gcctgttcaa atcgccagaa 54 0 

gagattgccg tactccgccg cgcgggagaa atcaccgcca tggcacatac acgggcgatg 600 

gaaaaatgcc gtccgggaat gttcgagtac catctggaag gcgaaattca ccacgaattt 660 

aaccgccacg gtgcgcgcta tccgtcctat aacaccattg tcggcagcgg tgaaaacggc 720 

tgcattctgc actacaccga aaacgagtgt gaaatgcgcg acggcgacct ggtgttgatt 780 

gacgcgggtt gtgaatacaa aggttacgct ggcgatatta cccgcacctt cccggtcaac 840 

ggcaaattca cccaggccca gcgtgaaatc tacgacattg tgctggagtc tctcgaaacc 900 

agcctgcgcc tgtatcgtcc gggaacttcc attctggaag tcactggtga agtggtgcgc 960 

atcatggtta gcggcctggt aaaactcggc atcctgaaag gtgatgttga tgaactgatc 1020 

gctcagaacg cccatcgtcc tttctttatg catggcctta gccactggtt aggactggat 1080 

gtccatgacg tgggtgttta tggtcaggat cgctcgcgca ttctggaacc gggcatggta 1140 

ctgaccgtag agccagggct gtatattgcg ccggatgcag aagtgccaga acaatatcgc 1200 

ggtatcggca ttcgtattga agacgacatt gtgattaccg aaaccggtaa cgaaaacctc 1260 

accgccagcg tggtgaaaaa gccggaagaa atcgaagcgt tgatggttgc tgcgagaaag 1320 

caatga 1326 

<210> 113 
<211> 585 
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<212> DNA 
<213> E. Coli 

<400> 113 

atgcttatgt ctatacagaa cgaaatgcct ggttacaacg 

caacaaggga cgggtctgac cccagctgag atgcatggtt 

ggcggtaacg atgacagctc atggctaccg ctacttcacg 

gctttcggtc atgagctggc acaggcactg cgtaaaatgc 

ctgcaggatg acggcttcct ttttcagctt tatctgcctg 

ttcgatcggg ctgatgcatt ggcaggttgg gtcaatcact 

acgcaaccga agctggataa agtgaccggc gaaaccggtg 

aacattgcgc aactgggtta cgacgaagac gaagatcagg 

gaagagatca tcgaatacgt tcgtgttgcc gcgctgttat 

ccgcaaccga ccgcgccaga agtacaaaaa ccgactctac 

<210> 114 
<211> 363 
<212> DNA 
<213> E. Coli 

<400> 114 

atgttaaagc tatttgcaaa gtacacctct attggtgtgc 
gtggtttttg gtgtttgtat ctatgtcgcg catacaaacc 
ggtttcgttg tggctgtgag ctttagcttc ttcgcgaatg 
tcgactacaa cgatgcgcta catgctatat gttgggttca 
gttggatggg ctgctgatag atgcgcactt cccccgatga 
gccatcagcc tggtgtgcgg tttcgtctat tcaaagttca 
tga 

<210> 115 
<211> 921 
<212> DNA 
<213> E. Coli 



aaatgaacca 
taatcagcgg 
acctgacgaa 
actctgccac 
atggcgatga 
tcctgcttgg 
aagctatcga 
aagagcttga 
gccacgacac 
actaa 



gtatctgaac 
gatgatatgt 
cgaaggcatg 
cagcgatgcc 
tgtcagcgtt 
tcttggcgtt 
cgatctgcgt 
aatgtcgctt 
ctttactcat 



tgaacaccct 
aagctcttgc 
caaaattcac 
tggggacact 
taactcttgt 
ttgtctttag 



tatacactgg 
aaacttcgca 
attcaaggca 
gagtgctact 
caccttctcc 
ggatgcgaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
585 



60 
120 
180 
240 
300 
360 
363 



<400 
atgaagatat 
aaaacggtac 
gacggcagca 
gttgttccgc 
gaccatgcaa 
gttattcctc 
tctgaccgct 
cacaataaaa 
cgtgatgttg 
ctgagctggg 
ggagatacaa 
ttttccacat 
tttatttatg 
tatccttcac 
gtattaggtg 
atcaagagag 



> 115 
ctcttgtagt 
gtgaattcga 
aagacgctac 
tgtcatttac 
ccggggatgc 
atcttattga 
caactgatgg 
taagcaatcc 
tcgaaaatat 
taggaggaaa 
aatttaatgg 
tccctcttcg 
gggcgtggat 
tacttgtttc 
aatatattgg 
tcaaaaaatg 



tcctgtcttc 
agaattgaag 
ggagtcaatc 
acgcaacttt 
gataatccca 
aaaatggcaa 
acgcctgaag 
taaaattgaa 
taaacttatg 
gacagatatt 
atggaaactt 
catctggaca 
gattttagat 
aatactgttt 
acgcacatac 



aatgaagaag 
tcatatgaag 
attaatgctc 
ggtaaagaac 
attgatgttg 
gcaggtgctg 
cgaaaaacgg 
gagaatgttg 
ccagaacgaa 
gttgaatacg 
tggaatttag 
tacatagggt 
actatcatat 
ttaggtggaa 
attgaaacca 



aagcgatacc 
tggaaatcgt 
tggctgtttc 
cagcattgtt 
acctgcaaga 
atatggttct 
ctgagtggtt 
gtgatttcag 
accttttcat 
tgcgagcgga 
cacttgaggg 
tagtggtagc 
ttggaaatgc 
ttcagatgat 
aaaaacgccc 



aattttttat 
tttcataaat 
agatcctcta 
tgcagggtta 
cccgattgag 
tgctaaaaga 
ctataagctc 
gctgatgagc 
gaaaggtatt 
aagaattgct 
tattacaagc 
cagtgtagca 
tgttagggga 
tggaatagga 
gaaatacatc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
921 



<210> 116 
<211> 1332 

<212> DNA 

<213> E. Coli 

<400> 116 

atgaataaag caataaaagt atcattgtat atatcttttg ttttgattat ttgcgcctta 
tctaaaaaca taatgatgtt aaatacatct gatttcggaa gagccattaa gccattaatt 
gaagacatac cagcatttac atatgactta cctttattgt ataaattgaa aggtcatatt 



60 
120 
180 
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gattcaattg 
ctgtttatta 
gtaatatata 
gtaactttat 
atgtttgcat 
tcattaacat 
cctactgcta 
tttgatagac 
atatttgcaa 
ttcggtagtt 
aagtgtgttg 
acagaagttg 
tttttattgg 
tctcagtata 
tcaaacatac 
atatcattgt 
tctttatttg 
ggatatagag 
tacataacag 
gtaaagcact 



atagctatga 
gcatttttac 
tttattcatt 
ttacattctt 
cattctatca 
gcaaaaacaa 
aaaatcaatt 
acaaactaat 
tatcttattc 
atctttatat 
ggttagatgc 
gaacggaatg 
ttagcaaacc 
aagaaaatta 
taacgactat 
tattattttt 
tagtatctct 
atttaagcaa 
tcgttttttt 
aa 



gtatataagt 
tgaatatctt 
atatgcgata 
tattttagct 
agagcaaata 
taaatctatg 
tatattaacc 
tattaaatct 
aaaaggtgtt 
gaaaaacaac 
ctggggtaat 
tttcgaatct 
aagcaccatc 
tttccatgta 
tactaacata 
tattgcttct 
ttttggaata 
gcatttattt 
aatttataaa 



tcatatagtt 
gatgctaggg 
tttacttcat 
tttcttatgt 
gttataattt 
cttttgctat 
ccactaatag 
gtaatatgcg 
gttgaattaa 
gggtataaaa 
aaattcgaca 
cataaagatg 
ttcaaacttc 
tataaaaaac 
aaagacaata 
atttttatta 
tctcaatttt 
ggaatgtatt 
ataattcaaa 



atattttgta 
tgttatcgtt 
atataaaaac 
gttcttcatc 
tccttccatt 
ttttttcgtt 
tgtattcata 
tggtgtgctt 
ataagtacca 
tgccatcgta 
tatcatttgg 
aaacgttttc 
catttgatga 
tacacgtaat 
tatttaaaaa 
gaaataataa 
atgtgtcatt 
tttcgttcga 
gaaatcaaga 



tacatacgtc 
atttctaaaa 
agaaaggtat 
aacactgtca 
tttggtgtat 
gctaataata 
ttatattttt 
gcttgcgtca 
tgcaacatac 
tgttgatgat 
cgcaacccca 
gaatgcactc 
tggtgtgatg 
atatggagaa 
cattagattt 
aataaaggca 
tttcggggaa 
cctttgctta 
caatagcgat 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1332 



<210> 117 
<211> 249 
<212> DNA 
<213> E . Coli 



<400> 117 

atgggcattc tgtcatggat tatttttggg cttattgccg gtattctggc gaagtggatc 60 

atgccaggta aagatggagg tggattcttt atgactatcc tgctggggat agtcggtgcc 120 

gtagtcggcg gatggatcag cacgctgttt ggctttggta aagtcgatgg cttcaatttt 180 

ggcagcttcg tggttgccgt tattggtgcg attgtcgtgc tatttatcta caggaagatt 240 

aaaagttaa " 24 9 

<210> 118 
<211> 183 
<212> DNA 
<213> E. Coli 



<400> 118 

atgggcaaag caacgtatac cgtgaccgtc accaataaca gcaatggcgt ttctgtcgat 60 

tatgaaacag agacgccgat gactttgctg gtgccagaag tggcggctga agtgataaaa 120 

gatctggtga ataccgtacg ttcttatgac acggaaaacg aacatgatgt ttgtggttgg 180 



<210> 119 
<211> 360 
<212> DNA 
<213> E . Coli 



<400> 119 

atgcttcaaa tcccacagaa ttatattcat acgcgctcaa cgcctttctg gaataaacaa 60 

actgcacctg ccggaatatt cgaacgtcat cttgataaag gaacgcgccc gggggtttac 120 

ccacgccttt ccgttatgca tggggcggtc aaatatctcg gctacgctga tgaacacagt 180 

gcagagcctg atcaggtgat ccttatcgaa gcggggcagt ttgcggtgtt ccctccagaa 240 

aagtggcaca acattgaagc catgactgac gatacttatt tcaacattga cttcttcgtg 300 

gctcctgaag tcctgatgga aggtgcgcaa caacggaaag tcattcataa cgggaaatga 360 

<210> 120 
<211> 741 
<212> DNA 
<213> E . Coli 
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<400> 120 

gtgaagttca aagttatcgc cctggcggca ttaatgggta ttagcgggat ggcagcgcag 60 

gctaacgaat tgccggatgg accgcatatt gtcacctccg gtacggcaag cgtggatgcg 120 

gtgccagaca ttgccactct tgcgattgaa gttaacgtgg ccgcgaagga tgccgctact 180 

gccaagaaac aggcagatga gcgcgtcgca caatacattt ccttccttga actcaatcag 240 

atcgcgaaaa aagatatcag ctcagcgaac ttacgcaccc agccagatta tgattatcag 300 

gatggtaaaa gtatccttaa aggctaccgc gctgtgagaa cggtggaagt cacgctccgt 360 

cagttagaca aactgaattc cttgctggat ggcgcgctga aggcgggtct taacgaaatt 420 

cgttctgtgt cgctgggcgt ggcgcagccg gatgcctata aagacaaagc gcgtaaggca 480 

gcgattgata acgcgattca tcaggcgcag gaactggcga acggctttca tcgtaaactg 540 

gggccggtat atagcgtgcg ctaccatgtt tccaactatc agcccagccc aatggtgcgg 600 

atgatgaaag ccgatgccgc gccggtgtcc gcccaggaaa cttacgagca ggccgctatt 660 

cagtttgatg atcaggtcga tgtggtcttc cagttagaac ctgtggatca acaacccgct 720 

aaaacacctg cagcacaata a 741 

<210> 121 
<211> 1395 
<212> DNA 
<213> E. Coli 

<400> 121 

gtgttattac tggatgcgtg ctcgcaaatg tgcccgtcat tcagacgatt ccagacagtg 60 

tttcataatt cctccatttt tctcccttat tggctggcta cactagtatc attccgcgaa 120 

acgtttcagg aagagaaact cttaacgatg aaaggtagtt ataaatcccg ttgggtaatc 180 

gtaatcgtgg tggttatcgc cgccatcgcc gcattctggt tctggcaagg ccgcaatgac 240 

tcccggagtg cagccccagg ggcgacgaaa caagcgcagc aatcgccagc gggtggtcga 300 

cgtggtatgc gttccggccc attagccccg gttcaggcgg cgaccgccgt agaacaggca 360 

gttccgcgtt acctcaccgg gcttggcacc attaccgccg ctaataccgt tacggtgcgc 420 

agccgcgtgg acggccaact gatagcgtta catttccagg aaggccagca ggtcaaagca 480 

ggcgatttac tggcagaaat tgaccccagc cagttcaaag ttgcattagc acaagcccag 540 

ggccaactgg caaaagataa agccacgctt gccaacgccc gccgtgacct ggcgcgttat 600 

caacaactgg caaaaaccaa tctcgtttcc cgccaggagc tggatgccca acaggcgctg 660 

gtcagtgaaa ccgaaggcac cattaaggct gatgaagcaa gcgttgccag cgcgcagctg 720 

caactcgact ggagccggat taccgcacca gtcgatggtc gcgttggtct caagcaggtt 780 

gatgttggta accaaatctc cagtggtgat accaccggga tcgtggtgat cacccagacg 840 

catcctatcg atttagtctt taccctgccg gaaagcgata tcgctaccgt agtgcaggcg 900 

cagaaagccg gaaaaccgct ggtggtagaa gcctgggatc gcaccaactc gaagaaatta 960 

agtgaaggca cgctgttaag tctagataac caaatcgatg ccactaccgg tacgattaaa 1020 

gtgaaagcac gctttaataa tcaggatgat gcgctgtttc ccaatcagtt tgttaacgcg 1080 

cgcatgttag tcgacaccga acaaaacgcc gtagtgatcc caacagccgc cctgcaaatg 1140 

ggcaatgaag gccattttgt ctgggtgctg aatagcgaaa acaaggtcag caaacatctg 1200 

gtgacgccgg gcattcagga cagtcagaaa gtggtgatcc gtgcaggtat ttctgcgggc 1260 

gatcgcgtgg tgacagacgg cattgatcgc ctgaccgaag gggcgaaagt ggaagtggtg 1320 

gaagcccaga gcgccactac tccggaagag aaagccacca gccgcgaata cgcgaaaaaa 1380 

ggagcacgct cctga 1395 

<210> 122 
<211> 3123 
<212> DNA 
<213> E. Coli 

<400> 122 

atgcaggtgt tacccccgag cagcacaggc ggcccgtcgc gcctgtttat tatgcgtcct 60 
gtggccacca cgctgctgat ggtggcgatc ttactcgccg ggattatcgg ttatcgcgcc ' 120' 

ctgcccgttt cggcgctgcc ggaagtggac tatccgacca ttcaggtggt cacgctctac 180 

ccaggtgcca gcccggatgt catgacctct gccgttaccg cgccgctaga acgccagttc 240 

gggcagatgt ctggcctgaa acagatgtcg tcgcaaagtt ccggcggtgc gtcagttatc 300 

actttgcagt tccagctaac attaccgctc gatgtcgccg agcaggaagt gcaggccgcg 360 

attaacgctg cgaccaactt gttgccgagc gatctgccta acccgccggt ttacagcaaa 420 

gtgaacccgg cagatccgcc gatcatgacg ctcgccgtca cctcaaccgc catgccgatg 480 
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acgcaagtgg aagatatggt ggaaacccgc gtcgcgcaga aaatctcgca gatttccggc 540 

gtcggcctgg tgacgctttc cggcggtcag cgtccggctg ttcgcgtcaa acttaacgct 600 

caggcgattg ccgccctcgg cctgaccagc gaaaccgtgc gcaccgccat taccggcgct 660 

aacgttaact cggcaaaagg tagcctcgac ggcccttccc gtgcggtcac gctttccgcg 720 

aacgaccaga tgcaatccgc cgaagagtat cgccagctaa tcatcgccta ccagaacggc 780 

gcgccaattc gtctgggcga tgtcgcaact gtagagcaag gtgcaaaaaa cagctggctc 840 

ggcgcgtggg cgaacaaaga acaggccatt gtgatgaatg ttcagcgcca gcccggtgct 900 

aacattatct ccaccgccga cagcattcgg cagatgctgc cacagctcac tgagagtctg 960 

ccgaaatcgg tgaaggtgac agtgctttcc gatcgcacca ccaatatccg cgcatccgtc 1020 

gatgatactc agtttgaatt gatgatggct atcgcgctgg tagtcatgat tatctacctg 1080 

tttttgcgca atattccggc gaccatcatt cccggtgttg ctgtaccgct gtcgttaatc 1140 

ggcactttcg cggttatggt gtttctcgat ttttcaatca ataacctgac actgatggcg 1200 

ttaactatcg ccaccggatt cgtggtcgat gacgccatcg tggtgatcga aaacatttcc 1260 

cgctatatcg aaaaaggcga aaaaccgttg gcggcggcgc tcaagggcgc aggtgaaatc 1320 

ggctttacca ttatctcgct gaccttctca ctgattgcgg tgttgatccc actgctgttt 1380 

atgggcgata tcgtcgggcg actgttccgc gaatttgcta ttaccctggc ggtagcgatt 1440 

ttgatctcag cggtggtgtc gctgaccctg acaccgatga tgtgcgcgcg gatgctcagc 1500 

caggagtcgt tgcgtaaaca gaaccgcttc tcccgtgcct cggaaaaaat gttcgacagg 1560 

ataatcgccg cctatggtcg tggactggcg aaagtgctga atcatccgtg gctgacctta 1620 

agcgtggcac tcagcacgct gctgcttagc gtgctgctgt gggtgttcat tccgaaaggt 1680 

ttcttcccgg tacaggacaa tggcattatt cagggcactt tgcaggcacc gcaatccagc 1740 

tcctttgcca atatggccca gcgacaacgc caggtcgcgg acgtgatttt gcaggatccg 1800 

gcagtgcaaa gcctgacctc atttgttggc gttgatggca ctaacccgtc gctgaacagt 1860 
gcacgtttac aaatcaacct caaaccgttg gatgaacgtg atgatcgggt gcaaaaagtc 
atcgcccgtc tgcaaacggc ggtagataaa gtgccgggcg tcgatctctt cctgcaacca 
acgcaggatc tgactattga tactcaggtc agccgcaccc agtaccagtt taccttgcag 

gccacgtcac tggatgcgct cagtacctgg gtgccacagt tgatggaaaa actccagcaa 2100 

ctgccacagc tttctgatgt ctccagcgac tggcaggaca aagggctggt ggcgtatgtc 2160 

aatgttgatc gcgacagcgc cagccgtctg gggatcagca tggcggatgt cgataacgcc 2220 

ctgtacaacg cgtttggtca gcggctgatt tccactattt atactcaggc caaccagtat 2280 

cgcgtggtgc tggagcacaa caccgaaaat accccaggcc tcgcggcgct ggataccatt 2340 

cgcctgacca gcagcgacgg cggcgtggtg ccgctaagct caattgccaa aattgagcag 2400 

cgttttgcgc cgctctccat caaccatctg gatcagttcc cggtaacgac catctccttt 24 60 

aacgtgccgg ataactattc gctgggcgat gcggtgcagg cgattatgga caccgaaaag 2520 

acgctgaatc tgccggtgga tatcaccacg cagttccagg gcagcaccct cgccttccag 2580 

tcggcgctgg gcagcactgt ctggctgatt gtcgcggcgg tggtggcgat gtatatcgtg 2 64 0 

ctcggcattc tgtacgagag ctttattcac ccgatcacca ttctctcgac gctacccacc 2700 

gcaggggttg gcgcactgct ggcgttgctg attgctggta gcgaactgga tgtgattgcg 2760 

attatcggca ttattttgct gatcggtatc gtgaagaaga acgccatcat gatgatcgac 2820 

ttcgcgctgg ctgctgagcg cgagcaaggc atgtcgccgc gcgaggcaat ctaccaggct 2880 

tgtctgttgc gttttcgtcc gatcctgatg accactctgg cggctctgct tggcgcgctg 2940 

ccgctgatgt tgagtaccgg ggtcggcgcg gaactgcgtc gtccgttagg tatcggcatg 3000 

gtcggcggtc tgattgtcag ccaggtgctg acgctgttta ccacgccggt gatttatttg 3060 

ctgttcgacc gcctggcatt gtggaccaaa agccgctttg cccgtcatga agaggaggcg 3120 



taa 



1920 
1980 
2040 



3123 



<210> 123 
<211> 3078 
<212> DNA 
<213> E. Coli 

<400> 123 

gtgaagtttt ttgccctctt catttaccgc ccggtggcga cgattttact gtcggttgcc 60 

attaccctgt gcggcatact gggcttccgt atgctgccgg tcgccccgct gccgcaggtc 120 

gattttccgg tgattatcgt cagcgcctcg ctgcccggtg cgtcaccaga aacaatggcg 180 

tcttccgttg ccacgccgct ggagcgctca cttgggcgca ttgccggagt cagtgaaatg 240 

acctccagca gttcgctcgg cagcacgcgt attattttgc agtttgattt tgaccgggat 300 

atcaacggcg cagcgcgtga tgtgcaggcg gcgatcaacg ctgcacaaag tttgctgccc 360 

agtgggatgc ccagccgccc gacctatcgc aaagcgaacc cgtcggatgc gccaattatg 420 

atcctcacgc tgacgtccga tacttattcg cagggtgaac tgtacgattt cgcctcgacg 4 80 

cagctggctc cgacgatttc gcaaatcgac ggtgttggtg atgtcgatgt cggaggcagc 540 
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tcactgcccg ccgtacgcgt cgggctgaat ccgcaggcgc tgtttaatca gggcgtgtcg 600 

ctggacgacg tacgcaccgc cgtcagcaat gccaacgtgc gtaaaccgca gggcgcgctg 660 

gaagatggca ctcaccgctg gcagatccag accaatgatg agctaaaaac cgccgctgaa 720 

tatcagccgt tgattattca ctacaacaac ggcggcgcgg ttcgtctggg cgatgtggcg 780 

acggtgaccg actcagtgca ggatgtgcgc aacgccggga tgaccaacgc caaaccggct 840 

attttactga tgatccgcaa actgccggaa gccaatatta tccagacggt tgacagcatc 900 

cgggcaaaat taccggagtt gcaggaaacc attccggcgg cgattgatct gcaaattgcc 960 

caggatcgct cccccaccat tcgcgcctcg ctggaagaag tcgagcaaac gctgattatc 1020 

tcggtggcgc tggtgattct ggtggtgttt ttattcctgc gctcgggtcg cgccactatt 1080 

attcccgccg tttcggtgcc ggtttcgctg attggtacgt ttgcggcgat gtacctgtgc 1140 

ggattcagtc tcaataacct ttcgttaatg gcgctcacca tcgctactgg tttcgtggtg 1200 

gatgacgcca tcgtggtgct ggaaaacatt gcacgtcatc tggaagcggg aatgaaaccg 1260 

ttgcaagccg cactgcaagg tactcgcgaa gtcggtttta cggtgctgtc gatgagtctg 1320 

tcactggtgg cggtgttcct gccgctgctg ttgatgggcg gattgccggg ccgactgtta 1380 

cgcgaatttg ccgtgacgct ttctgtcgcc attggtatat cgttgctggt ttctctgaca 1440 

ttaacgccaa tgatgtgtgg ctggatgctg aaagccagca agccgcgcga gcaaaagcga 1500 

ctgcgtggtt ttggtcgcat gttggtagcc ctgcaacaag gctacggcaa gtcactaaaa 1560 

tgggtgctca atcatacccg tctggtgggc gtggtgctgc ttggcaccat tgcgctgaat 1620 

atctggctgt atatctcgat cccgaaaacc ttcttcccgg agcaggacac tggcgtgttg 1680 

atgggcggga ttcaggcgga tcagagtatt tcgtttcagg cgatgcgcgg taagttgcag 1740 

gatttcatga aaattatccg tgacgatccg gcagtggata atgtcaccgg ctttacaggc 1800 

ggttcgcgag tgaacagcgg gatgatgttt atcaccctca agccacgcga cgaacgcagc 1860 

gaaacggcgc agcaaattat cgaccgtctg cgcgtaaaac tggcgaaaga accgggggcg 1920 

aatctgttcc tgatggcggt acaggatatt cgcgttggtg ggcgtcagtc gaacgccagc 1980 

taccagtaca cgttgttatc cgacgacctg gcggcactgc gagaatggga gccgaaaatc 2040 

cgcaaaaaac tggcgacgtt gccggaactg gcggacgtga actccgatca gcaggataac 2100 

ggcgcggaga tgaatctggt ttacgaccgc gacaccatgg cacggctggg aatcgacgta 2160 

caagccgcca acagtctgtt aaataacgcc ttcggtcagc ggcaaatctc gaccatttac 2220 

cagccgatga accagtataa agtggtgatg gaagtggatc cgcgctatac ccaggacatc 2280 

agtgcgctgg aaaaaatgtt cgttatcaat aacgaaggca aagcgatccc gctgtcgtat 234 0 

ttcgctaaat ggcaaccggc gaatgcccca ctatcggtga atcatcaggg attatcggcg 2400 

gcctcgacca tttcgtttaa cctgccgacc ggaaaatcgc tctcggacgc cagtgcggcg 24 60 

atcgatcgcg caatgaccca gcttggtgtg ccttcgacgg tgcgcggcag ttttgccggc 2520 

acggcgcagg tgttccagga gacgatgaac tcgcaggtga tcctgattat cgccgccatc 2580 

gccacggtgt atatcgtgct gggtatcctt tacgagagtt acgtacatcc gctgacgatt 2640 

ctctccaccc tgccctcggc gggcgttgga gcgctgttgg cgctggagct gttcaatgcc 2700 

ccgttcagcc taatcgccct gatagggatc atgctattaa tcggcatcgt gaagaaaaac 2760 

gccattatga tggtcgattt tgcgcttgaa gcccaacggc acggtaacct gacgccgcag 2820 

gaagctattt tccaggcctg tctgctgcgt tttcgcccga ttatgatgac taccctggcg 2880 

gcgctgtttg gtgcgctgcc gctggtattg tcgggcggcg acggctcgga gctgcggcaa 2940 

cccctgggga tcaccattgt cggcggactg gtaatgagcc agctccttac gctgtatacc 3000 

acgccggtgg tgtatctctt tttcgaccgt ctgcggctgc gtttttcgcg taaacctaaa 3060 

caaacggtaa ccgagtaa 3078 

<210> 124 
<211> 1416 
<212> DNA 
<213> E. Coli 

<400> 124 

atgacagatc ttcccgacag cacccgttgg caattgtgga ttgtggcttt cggcttcttt 60 

atgcagtcgc tggacaccac catcgtaaac accgcccttc cctcaatggc gcaaagcctc 120 

ggggaaagtc cgttgcatat gcacatggtc attgtctctt atgtgctgac cgtggcggtg 180 

atgctgcccg ccagcggctg gctggcggac aaagtcggcg tgcgcaatat tttctttacc 240 

gccatcgtgc tgtttactct cggttcactg ttttgcgcgc tttccggcac gctgaacgaa 300 

ctgttgctgg cacgcgcgtt acagggcgtt ggcggcgcga tgatggtgcc ggtcggcaga 360 

ttgacggtga tgaaaatcgt accgcgcgag caatatatgg cggcgatgac ctttgtcacg 420 

ttacccggtc aggtcggtcc gctgctcggt ccggcgctcg gcggtctgct ggtggagtac 480 

gcatcgtggc actggatctt tttgatcaac attccggtgg ggattatcgg tgcgatcgcc 540 

acattgctgt taatgccgaa ctacaccatg cagacgcggc gctttgatct ctccggattt 600 

ttattgctgg cggttggcat ggcggtatta accctggcgc tggacggcag taaaggtaca 660 
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ggtttatcgc cgctgacgat tgcaggcctg gtcgcagttg gcgtggtggc actggtgctt 720 

tatctgctgc acgccagaaa taacaaccgt gccctgttca gtctgaaact gttccgtact 780 

cgtacctttt cgctgggcct ggcggggagc tttgccggac gtattggcag tggcatgttg 840 

ccctttatga caccggtttt cctgcaaatt ggcctcggtt tctcgccgtt tcatgccgga 900 

ctgatgatga tcccgatggt gcttggcagc atgggaatga agcgaattgt ggtacaggtg 960 

gtgaatcgct ttggttatcg tcgggtactg gtagcgacca cgctgggtct gtcgctggtc 1020 

accctgttgt ttatgactac cgccctgctg ggctggtac^ acgttttgcc gttcgtcctg 1080 

tttttacaag ggatggtcaa ctcgacgcgt ttctcctcca tgaacaccct gacgctgaaa 1140 

gatctcccgg acaatctggc gagcagcggc aacagcctgc tgtcgatgat tatgcaattg 1200 

tcgatgagta tcggcgtcac tatcgccggg ctgttgctgg gactttttgg ttcacagcat 1260 

gtcagcgtcg acagcggcac cacacaaacc gtctttatgt acacctggct tagcatggcg 1320 

ttgatcatcg cccttccggc gttcatcttt gccagagtgc cgaacgatac gcatcaaaat 1380 

gtagctattt cgcggcgaaa aaggagcgcg caatga 1416 



<210> 125 
<211> 1035 
<212> DNA 
<213> E. Coli 



<400> 125 

atggaaattc gcataatgct atttatatta atgatgatgg ttatgcctgt gagctatgcg 60 

gcatgttata gtgagttatc tgttcagcac aacttggttg ttcaggggga ttttgcactt 120 

actcaaacac aaatggcgac atatgagcat aattttaatg attcgtcatg cgtaagtaca 180 

aatactatca cccctatgag cccgtcggat attattgttg gactttataa cgataccata 240 

aaattaaatt tacattttga atggaccaat aaaaacaaca tcacgttgtc aaataatcag 300 

accagtttca ccagtggtta ttcagttacg gtgacacctg cggccagtaa tgcaaaagtg 360 

aatgtttctg cggggggcgg cggttcagtg atgattaatg gtgttgcgac attatccagt 4 20 

gcttcatcat cgacacgcgg gagtgccgca gtacaatttc tactgtgttt attaggtggc 4 80 

aagtcatggg atgcatgtgt aaatagctac agaaatgcat tggcacaaaa tgcaggtgtc 540 

tattccttta atctgacatt gtcatacaac ccgataacca caacctgcaa accggacgat 600 

ttattaatta ctttagacag tattcccgtt tcacaattac cagccacagg taacaaagca 660 

acaataaata gtaaacaagg ggatattatt ctgcgttgta aaaatttatt aggtcaacaa 720 

aatcaaacat cacggaaaat gcaggtgtat ttatcaagtt ctgacttgtt aaccaacagc 780 

aacacaatac tgaaaggtgc ggaagataat ggcgtaggat ttattcttga aagtaatggt 840 

tcgccagtca cacttttaaa tatcactaac agcagtaaag gatatacaaa tttaaaggaa 900 

gttgcggcga agtcaaaact tacagataca acggtttcaa ttccgataac agccagttac 960 

tacgtctacg atacaaacaa agttaaatct ggcgcactgg aggcaaccgc attaatcaac 1020 

gtgaaatacg actaa 1035 

<210> 126 
<211> 2481 
<212> DNA 
<213> E. Coli 



<400> 126 

atgttgagaa tgaccccact tgcatcagca atcgtagcgt tattgctcgg cattgaagct 60 

tatgcagctg aagaaacctt tgatacccat tttatgatag gtggaatgaa agaccagcag 120 

gttgcaaata ttcgtcttga tgataatcaa cccttaccgg ggcagtatga catcgatatt 180 

tatgtcaata agcaatggcg cgggaaatat gagattattg ttaaagacaa cccgcaagaa 240 

acatgtttat caagagaagt tatcaagcgg ttaggcatta atagcgataa cttcgccagc 300 

ggtaagcaat gtttaacatt tgagcaactt gttcagggtg ggagctatac ctgggatatc 360 

ggggtttttc gtctcgattt cagtgtcccg caggcctggg tggaagaact ggaaagtggc 420 

tatgttccac cggaaaactg ggagcggggt attaatgcgt tttatacctc ttattatctg 480 

agtcagtatt acagcgacta taaagcgtcg ggtaataaca agagtacata tgtacgtttt 540 

aacagcgggt taaatttact ggggtggcaa ctgcattctg atgccagttt cagtaaaaca 600 

aataacaatc caggggtgtg gaaaagcaat accctgtatc tggaacgtgg atttgcccaa 660 

cttctcggca cgcttcgcgt gggtgatatg tacacatcaa gcgatatttt tgattctgtt 720 

cgcttcagag gtgtgcggtt gtttcgtgat atgcagatgt tgcctaactc gaaacaaaat 780 

tttacgccac gggtgcaggg gattgctcag agtaacgcgc tggtaactat tgaacagaat 840 

ggttttgtgg tttatcagaa agaggttcct cctggcccgt tcgcgattac agatttgcag 900 

ttggccggtg gtggagcaga tcttgatgtc agcgtgaaag aggcggacgg ctcggtaacc 960 
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acctatctgg 
gatttagcgg 
gcgggttatc 
aataattatt 
gtcgatgcca 
tatcaaattg 
rggcgttatt 
gataattatc 
tttggccgca 
tctgtgtcat 
tatcagttga 
gcttatgacg 
gattggggtg 
tttgatgatc 
gatcagttca 
ggggcgaatt 
agtacttatc 
gttaatctgg 
gatgcttatg 
gacggaatga 
gaagcagaat 
gttaattttg 
caatcattaa 
ggccagggaa 
attgataagc 
agaaattata 



tgccttatgc 
cgggtcgtag 
agtatggttt 
acgcgtttac 
ctaagtcgca 
cctacaacaa 
cgtcgcgtga 
gccgtgatga 
aaaatagctt 
taagtacgtt 
gttattccaa 
agaatcatca 
atgacgtttc 
aggggtttgc 
attatggtgt 
tgacctggaa 
gacaggctgg 
cgaaccgtct 
tcaatgggca 
caccttatcg 
tacgtggcaa 
ataccgatca 
cgtttggtta 
gtcagttatt 
aacaaggact 
tttgccagta 



agcggtgcca 
ccatattgaa 
taataattta 
tttgggggct 
tagtaaacaa 
atttgtgagc 
ttaccggaca 
aaacgatgtc 
ttccgccaat 
atggcgagat 
caacctgcga 
tgaagagaaa 
gacgcctcgt 
ctcaaataat 
caacctgagt 
cgcgccggtt 
agccagtgtt 
ttccgaaacg 
aaaatatcgc 
ggaaaatcac 
ccggaaaatt 
.gcgcaagcca 
tgaagtcaat 
tattcgcacc 
ttcatgcaca 



aatatgctgc 
ggggcgagca 
ttgacgctgt 
ggctggaata 
gacaacggcg 
caaacgtcga 
tttaacgatc 
tatgacattg 
atgagccagt 
tactgggggc 
cggataagct 
cgttttaata 
cggcaaatat 
acgggattat 
catcaacatc 
gcgacagtga 
tcagggggca 
tttgctgtga 
acaacaaacc 
ctgatgctgg 
gccgcccctt 
tggtttataa 
gatatccatg 
aatgaagtac 
atcaccttcg 



aacccggcgt 
aacaaagtga 
atggtggctc 
cacgcattgg 
atgtgtttga 
cgcgttttgg 
acgtttgggc 
ccgattatta 
cattgccaga 
gtagcggcag 
ataccctcgc 
tttttatatc 
atatgtctaa 
caggaacagt 
agggaaatga 
atggcagtta 
ttgtcgcctg 
tgaatgcgcc 
gtaatggagt 
atgtgtcgca 
atcgcggcgc 
aagcgttaag 
gtcataatat 
cgccatcggt 
gtaaagagat 



gtcgaaatat 
ttttgtccag 
gatggtcgcg 
tgccatttcc 
cgggcaaagt 
tctggcggcc 
aaacaataaa 
ccagaacgat 
aggttggggg 
tagtaaggat 
ggcaagccag 
gattcccttt 
ctcaacgacg 
agggagtcgg 
aacgacagct 
tagtcagtcg 
gtcgggtggc 
aggaattaaa 
ggtgatatac 
aagcgatagc 
ggttgtactg 
agcagatggg 
tggcgttgtc 
taatgtggca 
tgatgaaagt 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2481 



<210> 127 
<211> 720 
<212> DNA 
<213> E. Coli 



<400> 
atggccgcta 
tctttactca 
cgcattattt 
cgttcttcgc 
attcaggttc 
caagtaaaaa 
ctgaatgttc 
tttgcgatgc 
aaagcgacat 
gactcagcta 
gaaactatta 
aataactggc 



127 
tcccatggcg 
ttttttctat 
acccggcaga 
tgctgcaggc 
ctttcatgtt 
tcaaaattat 
tggatattcc 
aaaacagaat 
ttaaaaaatt 
attgggtgac 
tgattgcccc 
atctgaccat 



gccttttaat 
ggtccttcct 
aaataaagaa 
gtggattgat 
aacgccacca 
gccgaataaa 
accaaatagt 
taagttgttt 
gctggtaaat 
gatttcggat 
cttagaaagt 
tatcgatgac 



ttaagaggca 
gcacatgccg 
gtgatggtgc 
gatggcgata 
gtggcaaaaa 
ctgcccacta 
ccagagcaag 
taccggccag 
cgcagtggca 
gtcaaagcta 
cagagtgtta 
catggcaact 



ttaaaatgaa 
gaattgttat 
agttgatgaa 
cgtcattacc 
taggggcaaa 
ataaagaaag 
aaggtaagaa 
cgggtattgc 
atggtttggt 
ataatgtcaa 
atgtcaaaag 
atattagtga 



aggattatta 
ctacgggacg 
ccagggaaac 
accagaaaaa 
ttccgggcag 
cattttttat 
tgcactgaag 
tccggtaaat 
gataaaaaat 
agtcaattat 
taataatgca 
caaaatttaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



<210> 128 
<211> 543 
<212> DNA 
<213> E. Coli 



<400> 128 

atgaaacgtt caattattgc tgccgctgtc ttttcttctt tttttatgag cgctggagta 60 

tttgctgcag acgttgatac cggaacatta actattaagg ggaatattgc agaatctccg 120 

tgtaaattcg aagcgggtgg tgattcagta agtattaata tgccgactgt accaaccagt 180 

gtctttgaag gtaaagctaa atattctacc tatgatgatg cagtcggtgt aaccagcagc 240 

atgttaaaaa ttagctgccc gaaagaagtt gctggtgtaa aactctcgtt gattaccaac 300 

gataaaataa ccggtaacga taaggcgata gccagtagca acgataccgt gggttactat 360 

ctctatttag gtgataacag cgatgtcctg gatgtttctg caccttttaa cattgagagt 420 

tataaaacag cggaaggtca atatgctatt ccgtttaaag caaaatacct gaaactgaca 480 

gataactcag tgcaatcagg tgatgtgtta tcttctctgg ttatgcgtgt ggcgcaggat 540 
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taa 



<210> 129 
<211> 339 
<212> DNA 
<213> E. Coli 



543 



<400> 129 

atgagttcag agcgagatct ggttaatttt cttggcgatt tttcaatgga tgtggccaaa 60 

gcagttatag ccggtggtgt tgcaaccgct attggaagtc tggcttcttt tgcctgtgtt 120 

agctttggct ttccagtaat tcttgtcgga ggagcaattt tactgacagg gatagtgtgt 180 

acagttgttt taaatgaaat cgatgctcaa tgccatttat cagaaaaatt aaaatatgca 240 

attagagatg gactaaaacg gcaacaggaa cttgataaat ggaaaaggga aaacatgact 300 

ccatttatgt atgttcttaa cactccaccc gtgatatga 339 

<210> 130 
<211> 582 
<212> DNA 
<213> E . Coli 



<400> 
atgactgact 
aagtttctcg 
atggggctgg 
acgtggattt 
attgctgtgg 
cgtctgctgg 
ttgctgaata 
gctgtcggtt 
gctgatgtcc 
tctctggcct 



130 
acctgttact 
gtctctgtcc 
caacaacgtt 
tgatcccact 
tcgtgcagtt 
ggattttttt 
tcaatctcgg 
tctcgctggt 
cggcaccttt 
ttatgggctt 



gtttgtcgga 
gtttatgggg 
tgtgatgacg 
taatctgatt 
caccgagatg 
gccgcttatc 
gcacaatttc 
gatggtgctc 
tcgcggtaat 
tagtggtttg 



actgtactgg 
gtttccaaaa 
ctggcgtcta 
tacctgcgca 
gtggtgcgca 
accaccaact 
ttgcagtcgg 
ttcgccgcca 
gccattgcgt 
gtgaagttgt 



tcaataactt 
aactggaaac 
tttgcgcctg 
ccctggcatt 
aaaccagccc 
gtgcagtgct 
cgctgtacgg 
tccgcgaacg 
taattaccgc 



tgtactggtc 
cgcgatgggc 
gcttatcgat 
tattctggtg 
ggtgctttac 
cggcgtggcg 
tttttccgcc 
ccttgctgtg 
aggtcttatg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
582 



<210> 131 
<211> 579 
<212> DNA 
<213> E. Coli 



<400> 131 

atgaatgcta tctggattgc cgttgccgcc gtgagcctgc tgggcctggc gtttggcgcc 60 

attctgggtt atgcctcccg ccgttttgcg gtggaagacg atccggtcgt tgagaaaatt 120 

gacgaaatct taccgcagag ccagtgtggt cagtgcggtt atcccggctg tcgcccctac 180 

gcggaagcca tcagctgtaa cggtgaaaaa atcaaccgtt gcgccccagg tggcgaagct 24 0 

gtgatgctaa aaattgccga gttgcttaat gtcgagccgc agccgctgga tggcgaagcg 300 

caagagataa cgcctgcgcg gatggtggcg gttattgatg aaaataactg tattggctgc 360 

actaaatgta ttcaggcgtg tccggtagac gccatcgttg gcgctacccg tgccatgcat 420 

acggtaatga gtgatctctg tacgggctgc aatttatgtg ttgatccgtg cccgacgcac 480 

tgcatctcgt tgcaaccggt cgcagaaaca cctgactcct ggaaatggga tctgaacacc 540 

attcccgtgc gtatcattcc cgtggaacac catgcttaa 579 

<210> 132 
<211> 2223 
<212> DNA 
<213> E . Coli 



<400> 132 

atgcttaagt tattctctgc attcagaaaa aataaaatct gggatttcaa cggcggcatc 60 

catccaccgg agatgaaaac ccagtccaac ggtacacccc tgcgccaggt acccctggcg 120 

cagcgttttg ttattccact gaaacagcat attggcgctg aaggtgagtt gtgcgttagc 180 

gtcggcgata aagtattgcg cggccagccg cttacccgtg gtcgcggcaa aatgctgcct 240 

gttcacgcgc ccacctcggg taccgttacg gctattgcgc cccactctac ggctcatcct 300 

tcagctttag ctgaattaag cgtgattatt gatgccgatg gtgaagactg ctggatcccg 360 
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cgcgacggct gggccgatta tcgcactcgc agtcgcgaag agttaatcga gcgcatacat 420 

cagtttggtg ttgccgggct gggcggtgca ggattcccga caggcgttaa attgcagggt 480 

ggcggagata agattgaaac gttgattatc aacgcggctg agtgcgagcc gtacattacc 540 

gccgatgacc gtttgatgca ggattgcgcg gctcaggtcg tagagggtat tcgcattctt 600 

gcgcatattc tgcagccacg cgaaattctt atcggcattg aagataacaa accgcaggcg 660 

atttccatgc tgcgcgcggt gctggcggac tctaacgata tttctctgcg ggtgattcca 720 

accaaatatc cttctggcgg tgctaaacaa ttaacctaca ttctgaccgg gaagcaggtt 780 

ccacatggcg ggcgttcatc cgatatcggc gtattaatgc aaaacgtcgg cactgcttat 840 

gcagtgaaac gtgccgttat tgatggcgag ccgattaccg agcgtgttgt aaccctgact 900 

ggcgaagcaa tcgctcgccc gggcaacgtc tgggcacggc tggggacgcc agtgcgtcat 960 

ttattgaatg atgccggatt ctgcccctct gccgatcaaa tggtgattat gggtggcccg 1020 

ctaatgggct ttaccttgcc atggctggat gtcccggtcg taaagattac caactgtctg 1080 

ttggctccct ctgccaatga acttggcgaa ccacaggaag aacaaagctg catccggtgt 114 0 

agcgcctgtg ctgacgcctg ccctgcggat cttttgccgc aacagttgta ctggttcagc 1200 

aaaggtcagc aacacgataa agctaccacg cataacattg ctgattgcat tgaatgtggg 1260 

gcttgcgcgt gggtttgccc gagcaatatt cccctggtgc aatatttccg tcaggaaaaa 1320 

gctgaaattg cggctattcg tcaggaagaa aagcgcgccg cagaagccaa agcgcgtttc 1380 

gaagcgcgcc aggctcgtct ggagcgcgaa aaagcggctc gccttgaacg acataagagc 1440 

gcagccgttc aacctgcagc caaagataaa gatgcgattg ctgccgctct ggcgcgggtg 1500 

aaagagaaac aggcccaggc tacacagcct attgtgatta aagcgggcga acgcccggat 1560 

aacagtgcaa ttattgcagc acgggaagcc cgtaaagcgc aagccagagc gaaacaggca 1620 

gaactgcagc aaactaacga cgcagcaacc gttgctgatc cacgtaaaac tgccgttgaa 1680 

gcagctatcg cccgcgccaa agcgcgcaag ctggaacagc aacaggctaa tgcggaacca 1740 

gaacaacagg tcgatccgcg caaagccgcc gtcgaagccg ctattgcccg tgccaaagcg 1800 

cgcaagctgg aacagcaaca ggctaatgcg gaaccagaag aacaggtcga tccgcgcaaa 1860 

gccgccgtcg aagccgctat tgcccgtgcc aaagcacgca agctggaaca gcaacaggct 1920 

aatgccgagc cagaacaaca ggtcgatccg cgcaaagccg ccgtcgaagc cgctattgcc 1980 

cgagccaaag cgcgcaaacg ggaacagcaa ccggctaatg cggagccaga agaacaggtt 204 0 

gatccgcgca aagctgccgt cgaagcggct attgcacgcg ccaaagcacg caagctggaa 2100 

cagcaacagg ctaatgcggt accagaagaa caggttgatc cgcgcaaagc ggcagttgcc 2160 

gcggctattg cccgcgctca ggccaaaaaa gccgcccagc agaaggttgt aaacgaggac 2220 

taa 2223 

<210> 133 
<211> 1059 
<212> DNA 
<213> E. Coli 

<400> 133 

atggtattca gaatagctag ctccccttat acccataacc agcgccagac atcgcgcatt 60 

atgctgttgg tgttgctcgc agccgtgcca ggaatcgcag cgcaactgtg gttttttggt 120 

tggggtactc tcgttcagat cctgttggca tcggttagtg ctctgttagc cgaagctctc 180 

gtactcaaac tacgcaagca gtcggtagcc gcaacgttga aagataactc agcattgctg 24 0 

acaggcttat tgctggcggt aagtattccc cccctcgcgc catggtggat ggtcgtgctg 300 

ggtacggtgt ttgcggtgat catcgctaaa cagttgtatg gcggtctggg acaaaacccg 360 

tttaatccgg caatgattgg ttatgtggtc ttactgatct ccttccccgt gcagatgacc 420 

agctggttac cgccacatga aattgcggtc aacatccctg gttttatcga cgccatccag 480 

gttattttta gcggtcatac cgccagtggt ggtgatatga acacactacg cttaggtatt 54 0 

gatggcatta gtcaggcgac accgctggat acatttaaaa cctctgtccg tgccggtcat 600 

tcggttgaac agattatgca atatccgatc tacagcggta ttctggcggg cgctggttgg 660 

caatgggtaa atctcgcctg gctggctggc ggcgtatggt tgctatggca gaaagcgatt 720 

cgctggcata ttcccctcag cttcttagta acgctggcgt tatgcgcaat gttgggctgg 780 

ttgttctcac cagaaacact ggcagcaccg caaattcatc tgctgtctgg agcgaccatg 840 

ctcggcgcat tctttatttt gactgacccg gttaccgctt ctacgaccaa tcgtggtcgt 900 

cttattttcg gcgcgcttgc gggcttatta gtctggttga tccgcagttt cggcggctat 960- 

cctgacggcg tggcttttgc cgtcctgctg gcgaacatca cggttcctct gatcgattac 1020 

tacacgcgtc cgcgcgtcta cggccatcgc aaagggtaa 1059 

<210> 134 
<211> 621 
<212> DNA 
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<213> E . Coli 



<400> 134 

atgctgaaaa ctatccgaaa acacggcatt acgttggcgc tatttgcagc gggttcaaca 60 

gggttaactg cggccatcaa ccagatgacc aaaacgacga ttgctgaaca ggccagtctg 120 

caacaaaagg cgttatttga tcaggtgctg ccagccgaac gctataacaa tgcgctggca 180 

cagagttgct atctggtaac tgcgccagag ttaggtaaag gtgagcatcg ggtttacatc 240 

gccaaacagg atgacaaacc ggtagccgcc gttctggaag caaccgcgcc agatggctat 300 

tccggtgcga ttcagctgct ggtgggagcc gattttaacg gcacggtact tggcacgcgc 360 

gtgacagagc accacgaaac gccagggctt ggcgataaaa tcgaactgcg cctttctgac 420 

tggatcaccc attttgcggg taaaaaaatc agtggtgcag atgatgcgca ctgggcggtg 480 

aagaaagatg gtggtgattt cgaccagttc accggcgcga cgattactcc ccgcgcggtg 540 

gttaatgcgg taaaacgcgc cggattgtac gctcagacgt taccggcaca actttctcaa 600 

cttcctgcct gtggagaata a 621 

<210> 135 
<211> 696 
<212> DNA 
<213> E. Coli 



<400> 135 



gtgagcgaaa 
cagttgctcg 
ttaggacttg 
cactggacgc 
agcgctgtac 
tttattccgc 
aaaaaaggtc 
gccatgttcg 
gcagatgcgc 
tcccctttcc 
gcaggaaaat 
cgtgcattgc 



ttaaagacgt 
gcctttgtcc 
cgactacgct 
cagccgagat 
agatgctgat 
tgattgtcac 
cggcgctttc 
tgctgggttc 
tgttaggtag 
tgctggcgat 
acctgattga 
caaacggtga 



tattgttcag 
tctgttggcg 
ggtactgacg 
ccgcattccc 
caacgcctac 
taactgtatc 
ggcactggac 
actacgcgaa 
ctgggcaaaa 
gctgccacca 
tgaaagaatg 
aacagggaat 



gggttgtgga 
gtcacgtcca 
ctgaccaacc 
atttacgtga 
gcctttggcc 
gttgtgggcc 
ggcttttcaa 
attatcggca 
gtattacgcg 
ggtgcattta 
aaaaagcgcc 
gtctga 



aaaacaactc 
ctgccactaa 
tgaccatttc 
tgatcatcgc 
tgtatcaatc 
gcgctgaagc 
ttggtatggg 
atggcacatt 
tggagatttt 
ttggcctggg 
gtgctgaagc 



tgcgctggtc 
cgctctgggt 
gacgctgcgt 
ctcggtggtc 
attagggatt 
cttcgccgcc 
cgcaacctgc 
gtttgacggt 
ccacaccgac 
actgatgctg 
agctgcagaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
696 



<210> 136 
<211> 636 
<212> DNA 
<213> E. Coli 



<400> 136 

atgaataaag caaaacgcct ggagatcctc actcgcctgc gtgagaacaa tcctcatccc 60 

accaccgagc ttaatttcag ttcgcctttt gaattgctga ttgccgtact gctttccgct 120 

caggcgaccg atgtcagtgt taataaggcg acggcgaaac tctacccggt ggcgaatacg 180 

cctgcagcga tgcttgaact gggcgttgaa ggggtgaaaa cctatatcaa aacgattggg 240 

ctttataaca gcaaagcaga aaatatcatc aaaacctgcc gtatcttgct ggagcagcat 300 

aatggcgagg ttccggaaga tcgtgctgcg cttgaagccc tgcccggcgt aggtcgtaaa 360 

acagccaacg tcgtattaaa cactgcattc ggctggccga ctattgctgt cgacacgcac 420 

attttccgcg tttgtaatcg tactcaattt gcgccgggga aaaacgtcga acaggtagaa 480 

gaaaagctac tgaaagtggt tccagcagag tttaaagtcg actgccacca ttggttgatc 540 

ctgcacgggc gttatacctg cattgcccgc aagccccgct gtggctcttg tattattgaa 600 

gatctttgtg aatacaaaga gaaagttgac atctga 636 

<210> 137 
<211> 504 
<212> DNA 
<213> E. Coli 



<400> 137 

atgaaaagac ttcacaagag gttcctgtta gctacgtttt gcgcgttatt cacagcaact 60 

ctccaggccg ccgatgtcac tatcactgtt aatggtcggg tagtcgctaa accctgcact 120 

attcaaacca aagaagctaa cgttaatctc ggggatcttt atacgcgcaa tctgcaacaa 180 
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cctggttctg catctggctg gcacaatatt actttgtcat taaccgattg tccggttgaa 240 

acaagtgcag tgacggcaat cgtgacaggt tcaactgaca atacgggtta ttacaaaaat 300 

gaaggtactg ccgaaaatat tcagatagag ctgagggatg accaggatgc tgcgttaaaa 360 

aatggcgata gcaaaacggt tattgttgat gagatcactc gtaatgcaca gtttccactt 420 

aaggcaagag ctatcacggt gaatggaaac gcaagccagg gaacgatcga ggcgctaatc 4 80 

aatgtgatct acacctggca ataa " 5Q4 

<210> 138 

<211> 531 

<212> DNA 

<213> E. Coli 



<400> 138 

atgaaataca ataacattat tttcctcggt ttatgtctgg ggttaaccac ctattctgct 60 

ttatccgcag atagcgttat taaaattagc gggcgcgtcc tcgattatgg ctgcacagtc 120 

tcatcggatt cgcttaattt taccgtagat ctccaaaaaa acagtgccag acaatttcca 180 

acgaccggta gcacaagtcc agccgtccct tttcagatta cgttaagtga atgcagcaaa 24 0 

gggacaacgg gggttcgggt tgcatttaac ggtattgagg atgcagaaaa taatactttg 300 

ttgaaactgg atgaaggaag caatacggcc tccggtttgg gtatagaaat attggacgca 360 

aatatgcgtc cggtgaaact gaatgatctt catgccggga tgcagtggat cccactggta 420 

ccagaacaga acaatatttt gccttactcc gctcgtctga agtcaactca gaagtccgtc 480 

aatccgggac tggtgagggc ttcggcaacc tttacccttg aatttcaata a 531 

<210> 139 
<211> 1149 
<212> DNA 
<213> E . Coli 

<400> 139 

atgagtggtt acaccgtcaa gcctcctacc ggagacacca atgagcagac acaatttatt 60 

gattatttta atctgttcta cagtaagcgt ggtcaggaac aaataagcat ctctcagcag 120 

cttggaaatt acggtacgac atttttcagt gccagtcgcc aaagttactg gaacacgtca 180 

cgcagcgacc agcaaatatc atttggatta aatgtgccgt ttggtgatat tacgacttcg 24 0 

ctgaattaca gctattccaa taatatatgg caaaacgatc gggatcattt actcgctttt 300 

acgcttaatg ttcccttcag tcattggatg cgtacagaca gtcagtcggc atttcgtaat 360 

tcaaacgcca gttacagtat gtcaaacgat ttgaaaggcg gcatgaccaa tctatcgggg 420 

gtttatggca ctctgctgcc ggataataac ctgaattata gcgttcaggt cggtaacacc 480 

cacggaggta atacatcgtc tggcaccagt ggttacagtt ctcttaatta tcgtggagct 54 0 

tatggtaata ctaatgtcgg ttacagtcgg agtggtgaca gcagccagat ttattacgga 600 

atgagtggtg ggattattgc tcatgctgat ggcatcacct ttggacagcc gctgggcgac 660 

acaatggttc tggttaaggc tcctggtgct gataatgtca aaatagagaa ccagaccgga 720 

attcataccg actggcgtgg ctatgccata ttaccatttg cgacagaata tagagaaaac 780 

cgtgttgctc ttaacgcgaa ttcccttgca gataatgttg aactggatga aaccgtggtc 840 

actgtcatcc caactcacgg tgctattgcc agagcaacat ttaatgcaca aatcggcggg 900 

aaagtattaa tgacgttgaa gtacggtaat aagagcgttc cattcggtgc aattgtcaca 960 

cacggagaga ataaaaatgg cagcattgtc gcggaaaatg gtcaggttta tctgactgga 1020 

cttccacagt cagggcaatt acaggtttca tggggcaaag ataaaaactc aaactgtatt 1080 

gtcgagtaca agcttcctga agtttctcct ggtaccttac tgaaccagca gacagcaatc 1140 

tgtcgctaa 1149 

<210> 140 
<211> 417 
<212> DNA 
<213> E . Coli 

<400> 140 

atgattgcga ttgccgacat cttgcaagca ggagaaaagc taactgctgt ggcacctttt 60 

ctggcgggta ttcagaacga ggaacaatac acccaggcgc tggaactggt agatcatctg 120 

ctgctcaacg atcctgaaaa ccccttgctg gatctggtgt gtgccaaaat aaccgcgtgg 180 

gaagaatcag cgcccgaatt tgcggaattt aatgccatgg ctcaagccat gcctggcggt 240 

atagccgtga ttcgtaccct tatggatcaa tatggtttaa ccctttccga tctgccggaa 300 
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attggcagta aatctatggt 
cacgctaaaa aattggcaac 

<210> 141 
<211> 315 
<212> DNA 
<213> E. Coli 

<400> 141 
atgcacctga taactcaaaa 
acggagttgg tggctctggg 
ttaaaagcag tattcccatc 
aatgttgggg gcaatgaatt 
tacatacgtg aagttatgac 
aaggggaaaa aatga 

<210> 142 
<211> 7152 
<212> DNA 
<213> E. Coli 

<400> 142 

ttgctatcag tatttacatt ttttcgctgt gctagaaagg gcgcatttat gttagctcgt 60 

tcagggaagg taagcatggc tacgaagaag agaagtggag aagaaataaa tgaccgacaa 120 

atattatgcg ggatgggaat taaactacgc cgcttaactg cgggtatctg tctgataact 180 

caacttgcgt tccctatggc tgcggcagca caaggtgtgg taaacgccgc aacccaacaa 240 

ccagttcctg cacaaattgc cattgcaaat gccaatacgg tgccctacac ccttggagcg 300 

ttggaatcgg cccaaagcgt tgccgaacgt ttcggtattt cggtggctga gttacgcaaa 360 

ctcaaccagt ttcgtacgtt tgctcgaagt tttgataatg tccgccaggg tgatgaactg 420 

gatgtcccgg cacaagttag tgaaaaaaaa ttaaccccgc cgccgggtaa tagcagtgac 480 

aacctcgagc aacagatagc cagtacttca cagcaaatcg ggtctctgct cgccgaagat 540 

atgaacagcg agcaagcggc aaatatggcg cgtggatggg cctcttctca ggcttcaggc 600 

gcaatgacag actggttaag ccgcttcggt accgcaagaa tcacgctggg cgtggatgaa 660 

gattttagcc tgaagaactc ccagttcgat tttctccatc cgtggtatga aacgcctgat 720 

aatctctttt tcagtcagca tactctccat cgtactgacg agcgtacgca gattaacaac 780 

ggcttaggtt ggcgtcattt cactcccaca tggatgtcgg gcatcaactt ctttttcgac 840 

cacgatctta gccgttacca ctcccgcgcc ggcattggcg cggagtactg gcgcgactat 900 

ctaaaattaa gcagtaacgg ctatttgcga ctgaccaact ggcgcagcgc acctgaactg 960 

gacaacgatt atgaagcacg cccggccaat ggctgggatg tacgcgcaga aagctggcta 1020 

cccgcctggc cgcaccttgg cggtaaactg gtctatgaac agtattatgg cgatgaagtg 1080 

gccctgttcg ataaagacga tcggcaaagt aatcctcatg ccataaccgc tggacttaac 1140 

tataccccct tcccgctgat gaccttcagc gcggagcaac gccagggtaa acagggcgaa 1200 

aatgacaccc gttttgccgt cgattttacc tggcaacctg gcagcgcaat gcagaaacag 1260 

cttgacccga atgaagtcgc tgcacggcgt agccttgcag gcagccgtta tgatctggtg 1320 

gatcgcaaca acaatatcgt tctggaatat cgcaaaaaag aactggttcg cctgaccctg 1380 

acagaccccg tgacagggaa gtcaggagaa gtgaaatcac tggtttcgtc gctacaaacc 1440 

aaatatgccc tgaaaggcta taacgtcgaa gccaccgcac tggaagctgc cggtggcaaa 1500 

gtggtcacaa cgggtaaaga tattctggtt accctgccgg cttaccggtt caccagtacg 1560 

ccagaaaccg ataacacctg gccgattgaa gtcaccgccg aagatgtcaa aggcaatttg 1620 

tcgaatcgtg aacagagcat ggtggtcgtt caggcaccta cgctaagcca gaaagattcc 1680 

tcggtatcgt taagtaccca aacattgaac gcggattccc attcaaccgc cacactgact 1740 

tttattgcgc atgatgcagc aggtaatcct gttgtcgggc tggtgctctc gacgcgtcac 1800 

gaaggtgttc aggacatcac cctttctgac tggaaagata atggtgacgg aagctatacc 1860 

cagatcctga ccacaggagc gatgtctggc acgctgacgc tgatgccaca gctgaatggt 1920 

gtggatgcgg ctaaagcccc cgccgtggtg aatatcattt ctgtttcgtc atcccgaact 1980 

cactcgtcaa ttaagattga taaggaccgt tatctctccg gcaatcctat cgaggtgacg 2040 

gtagaactga gagatgaaaa tgacaaacct gttaaggaac aaaaacagca actgaataac 2100 

gcagtcagca tcgacaacgt gaaaccagga gtcactacag actggaaaga aaccgcagat 2160 

ggcgtctata aggcgaccta taccgcctat accaaaggca gtggacttac tgcgaagcta 2220 

ttaatgcaaa actggaatga agatttgcat accgctggat ttatcatcga cgccaacccg 2280 

cagtcagcga aaattgcgac attatctgcc agcaataatg gtgtgctcgc caatgagaat 2340 



gtcacgcgtt ttgagcggga agaggaaatt aacgctggaa 360 
gcgattcggc atttctcccg ccttgtttat tgattaa 417 



agcattgaaa gatgctgcgg aaaaataccc gcaacataaa 60 
gaacacgatt gctaagggat atttcaaaaa acctgagtca 120 
tctggataac ttcaaatatc tggataagca ttatgttttc 180 
acgtgttgta gcaatggtct tttttgaatc gcaaaagtgc 240 
gcataaagaa tacgatttct ttaccgctgt tcatcgtact 300 
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gcagcaaaca ccgtctcggt caatgtcgct gatgaaggaa gcaacccaat caatgatcat 2400 

accgtcacgt ttgcggtatt aagcggatcg gcaacttcct tcaacaatca aaacaccgca 2460 

aaaacggatg ttaatggtct ggcgactttt gatctgaaaa gtagtaagca ggaagacaac 2520 

acggttgaag tcacccttga aaatggcgtg aaacaaacgt taatcgtcag ttttgtcggc 2580 

gactcgagta ctgcgcaggt tgatctgcag aagtcgaaaa atgaagtggt tgctgacggc 2640 

aatgacagcg tcacaatgac cgcgaccgtc cgggatgcaa aaggcaacct gctcaatgac 2700 

gtcatggtca ctttcaatgt taattcagca gaggcgaaac tgagccaaac cgaagtgaat 2760 

agccacgacg ggatcgccac agctacgctg accagtttga aaaatggtga ttatagggtt 2820 

acggcctctg tgagctctgg ttcccaggct aatcaacagg tgaattttat cggtgatcaa 2880 

agtactgctg ccctgaccct cagtgtgcct tcaggtgata tcaccgtcac caacacagct 2940 

ccgcaatata tgactgcaac cttgcaggat aaaaatggca acccactaaa agataaagaa 3000 

atcaccttct ctgtgccaaa cgacgtcgca agtaagttct cgattagcaa cggaggaaaa 3060 

ggcatgacgg atagtaacgg ggttgcaatc gcctccctga ccggcacgtt agcgggcacg 3120 

catatgatca tggctcgtct ggctaacagc aatgtcagcg atgcacagcc aatgacgttt 3180 

gtggcggata aagacagagc ggttgtcgtt ttgcaaacat cgaaagcgga aatcattggg 3240 

aatggcgtgg atgagacaac tctgacagca acagtgaaag atccgtcgaa tcatccggtg 3300 

gcggggataa cggtaaactt caccatgcca caggacgttg cggcaaactt tacccttgaa 3360 

aataacggta ttgccatcac tcaggccaat ggggaagcgc atgtcacgct gaaaggtaaa 3420 

aaagcgggca cgcatacggt taccgcaacg ctgggtaata acaataccag tgattcgcag 3480 

ccggtaacat ttgtggcgga caaagcctcg gctcaggttg tcctgcagat atcaaaagat 3540 

gagatcacag gtaatggcgt cgatagcgca acgctaactg caacggttaa agatcagttc 3600 

gacaatgagg tgaataatct tccggtaaca ttcagctcag cctcttcagg actcaccctg 3660 

accccgggag taagtaatac caacgagtct ggcatcgcgc aggccactct cgcaggcgtt 3720 

gcctttggtg agaagacggt tactgcatca ctggctaata atggtgccag cgacaacaaa 3780 

actgtgcatt ttattggcga cacagcggcg gcaaaaatta tcgagttggc gcctgtccca 384 0 

gacagcataa tcgccggtac cccgcagaac agctccggca gcgtcatcac cgccacagtc 3900 

gttgataata atggctttcc ggtgaaaggt gtgactgtga acttcaccag caacgcagcg 3960 

acagccgaaa tgacgaacgg cggtcaagcc gtgacgaacg aacagggtaa ggctaccgtc 4020 

acttatacca atacccgctc ctcgatagaa tcaggagcga gaccggatac cgttgaggcc 4080 

agtctggaaa atggtagctc cacgcttagc acatcaatta atgtcaacgc tgatgcgtct 4140 

acggcacatc tcaccttgct acaggcactt tttgatacag tctccgcagg cgagacaacc 4200 

agtctgtata ttgaggtgaa ggataattac ggcaacggtg tcccccagca ggaggtaacc 4260 

ctcagcgttt caccaagtga aggcgtgacc cccagtaata acgctatata tactaccaac 4320 

cacgacggca atttttacgc aagctttacc gctacaaaag ccggggttta tcaattgacg 4380 

gcaaccctcg aaaatggcga ttcgatgcaa caaacagtga cctatgtgcc gaacgtcgcg 4 440 

aatgctgaaa tcacgctggc agcctcgaag gatccggtga ttgccgacaa taacgatctc 4500 

acgacactaa cagcaacagt cgctgataca gagggcaatg cgatagccaa cactgaggta 4560 

acatttactc tgccggaaga tgtgaaggcg aacttcacgc tgagcgatgg cggtaaagtg 4 620 

attactgatg ctgaaggcaa agcgaaagtc acgctgaaag gtacaaaagc aggcgctcat 4 680 

actgttacag catcgatgac tggcggtaag agtgagcagt tggtggtgaa ctttattgcg 4740 

gatacgctca ctgcgcaggt taatcttaac gttaccgagg acaattttat cgctaataac 4800 

gtcgggatga ccaggctgca ggcaacagtg actgatggaa acggcaaccc gttagccaat 4860 

gaggcggtga cattcacgct accggcagat gtgagcgcaa gctttactct cggacaaggc 4 920 

ggttccgcca ttactgatat caacggcaag gctgaagtta cactgagcgg tacaaaatcc 4 980 

ggcacctacc ccgtgacagt tagcgtgaac aattatggtg tcagtgatac gaaacaggtg 5040 

actttgattg ccgatgctgg taccgcaaaa ctagcctcct taacctctgt atactcattc 5100 

gtcgtcagca cgaccgaggg cgcaaccatg acggcaagcg tcactgacgc taacggcaac 5160 

ccggtagaag gcataaaagt taatttccgc ggaacctccg tcacgctaag cagcaccagc 5220 

gttgaaacgg atgatcgggg tttcgctgaa attcttgtga caagcaccga ggtcggactg 5280 

aaaacagttt cagcctctct ggcagataaa cctactgaag tcatctcgcg attactgaat 5340 

gccagtgcag atgttaattc tgcgacgatt accagtctgg agataccgga aggtcaggta 5400 

atggtcgcac aagacgtagc agttaaagct cacgttaacg accagtttgg caacccggtt 54 60 

gcgcatcaac ccgtgacatt cagtgcagag ccatcctcgc aaatgatcat cagccagaat 5520 

acggtctcta ctaatacgca gggtgtagcc gaggtcacca tgacgcccga aagaaacggt 5580 

tcgtatatgg tgaaagcatc cctgccgaat ggagcctcac ttgagaaaca actggaggct - 5640 

attgatgaaa aactgacact cacggcgtcc agtccgctta tcggtgtcta tgcccctaca 5700 

ggcgctactc tgacggcaac gctaacctct gcaaatggca ctccagtgga gggtcaggtc 57 60 

atcaacttta gcgtaacgcc agaaggggcg acgttaagtg gcggaaaagt gagaactaac 5820 

tcttcaggtc aggctccagt cgttttgacc agcaataaag tcggtacata tacggtgact 5880 

gcatctttcc ataacggcgt aacaatacag acacagacaa ccgtgaaagt cactggcaac 5940 

tcaagcaccg cccatgttgc tagctttatc gctgatccat cgactatcgc cgccaccaac 6000 



-48- 



WO 00/44906 



PCT/USOO/02200 



actgatttaa gtaccttaaa ggcaacggtt gaggatggca gtggtaacct gatcgaaggt 6060 

ctcactgtgt acttcgcctt aaaaagcggc tctgccacat taacgtcatt aacagcggtg 6120 

accgatcaaa acggaatcgc gacaacaagc gtgaaaggag cgatgacagg tagcgtcacg 6180 

gtaagcgcag tcacgaccgc tggtggaatg caaacagtag atataacgct ggtggctggc 6240 

ccggcagaca cctcgcagtc cgtccttaag agcaatcggt catcactgaa aggggactat 6300 

accgatagtg ctgaattacg tcttgttctg cacgatatat caggcaatcc gatcaaagtt 6360 

tctgaaggga tggaatttgt gcaatcaggt actaacgtgc cctatataaa aattagcgca 6420 

attgattaca gtctaaatat caacggtgat tacaaagcca ctgttacagg aggcggagag 6480 

ggtatcgcaa cgctgatccc tgtattgaat ggtgttcatc aagctggtct gagtaccaca 6540 

atacaattca ctcgcgcaga agacaaaata atgagcggta cagtatcagt caatggtact 6600 

gacctaccga caactacatt cccttcgcag gggttcaccg gggcgtatta tcagttgaat 6660 

aatgacaact ttgccccagg aaaaacggcg gctgattatg agttttcaag ctctgcctcc 6720 

tgggtcgatg ttgatgctac cggtaaagtg acatttaaaa atgtcggcag caattcggaa 6780 

aggattacgg cgacgccaaa atcaggaggc cctagctatg tatacgaaat ccgtgtgaag 6840 

agttggtggg tgaacgccgg cgaggctttc atgatataca gccttgctga aaatttttgc 6900 

agcagcaatg gctacacgct ccccagagca aactatttaa accactgtag ttcccgaggc 6960 

atcgggtcac tgtacagtga atggggagat atggggcatt acacgactga cgctggtttt 7020 

caatcaaata tgtattggtc atctagtccc gcaaactcaa gcgaacaata cgtagtttcc 7080 

ctggcaacag gtgatcaaag cgtatttgaa aagcttgggt ttgcttatgc gacatgttat 7140 
aaaaacctgt ga 7152 
<210> 143 

<211> 186 

<212> DNA 

<213> E. Coli ■ - 



<400> 143 

atgagcaaag gcgcattata tgaatttaac aatccagatc aactgaaaat acctctccct 
cataaacaca tagcgtcaac attcaatgac ataatgagta aagatgttgg ttatgcatac 
gtatcattac tctatgcctg tcccttaaaa acccactcat taagactgaa tccattcagc 
aaatga 

<210> 144 
<211> 1197 
<212> DNA 
<213> E . Coli 



60 
120 
180 
186 



<400> 144 

atgcaggtgg ctgaacagcg cattcagcta gctgaagccc 
caggatggtc cgcagatcga cttttcggcg gatatggagc 
ggcttaatgg ggccgtttgc tctgaacgat ccggccgcag 
accaacggta cttttggctt aacggcgggc tggcatctcg 
gcggaggtta ctgcccgcct gggtacggtt aaagcacggg 
cgccaattgc tggctggcag cgtagcccgc ctgtactggg 
ttaaacacgg tcttgcagca aatagaaaaa gagcagaaca 
cagctatatc agaacgggat tacttcttca gttgaaggtg 
agcaaaaccc ggcagcagct caacgatgtc gcggggaaaa 
ttaagcgcac ttacaaataa ccagacaaag tcattgaagc 
aaagtggcaa gccagcttcc tgatgaactg gggtactcct 
ttgcaggcgg cgcactggta cgttgagtca tcgctaagca 
gcattttatc ctgacatcaa cctgatggcc ttcctgcaac 
gatctgttcc gtcattccgc gcagcaaatg ggcgttacgg 
ttcgatagtg gtcgtcttaa cgccaatctc gatatcgcaa 
atcgccagct acaacaaagc ggtggttgaa gcggtgaatg 
caggttcaga cactggcgga gaaaaaccag catcaggcgc 
cgtgtggtag gtcttgcgca ggcgcgcttt aacgcgggca 
agcgaagcca gaatccccgc gctgcgtgag cgggccaatg 
tggctggatg cctccattca actcactggt gcgttgggcg 

<210> 145 
<211> 291 
<212> DNA 



aggcgaaggc 


agttgccact 


60 


ggcaaaaaat 


gtcggcagaa 


120 


gtacgaccgg 


cccgtggtac 


180 


atatctgggg 


aaagaatcgg 


240 


cggcggaacg 


cgagcaaacc 


300 


agtggcaaac 


ccaggcggcg 


360 


ccattatcgc 


gaccgatcgc 


420 


tggaaaccga 


tattaatgcc 


480 


tgaaaattat 


tgaggcacgg 


540 


ttaaaccggt 


cgcgttgccg 


600 


tactggcccg 


gcgggcagat 


660 


ccattgatgc 


ggcaaaagcg 


720 


aggatgcgtt 


gcacttaagc 


780 


caggcctgac 


gctacccatt 


840 


aagccgaaag 


caacttgtct 


900 


acgtggcgcg 


ggcagccagt 


960 


aaattgagcg 


cgatgccttg 


1020 


tcattgctgg 


ttcccgcgtc 


1080 


gcctgttatt 


gcaagggcag 


1140 


gggggtacaa 


acgctga 


1197 
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<213> E. Coli 

<400> 145 
atgtattgcc acgcgaaact aaaaaatata 
ttcttacctg attattcccc catgaatcgt 
ccgctgttac tgatgctggc cgggtgtgcg 
cagcaaacgc ccgctgcaca agttgacacc 
cagacagcca atggtggctg gagtatcacg 

<210> 146 

<211> 948 

<212> DNA 

<213> E. Coli 



tcgcaacaca cggtaatctc cgcgcacctt 60 

gattcctttt atccagccat cgcctgtttt 120 

cctatgcatg aaacccgcca ggcgttaagc 180 

gcattaccca cggcgctgaa aatggttggc 240 

ataatcaact cacttcctta a 291 



<400> 146 



atgcgtgtgt 
accgaagtta 
ctgccggtaa 
tctggtacgc 
gccgcccgtg 
aaaacggtta 
cagggtgcaa 
cgtctgggct 
ggcgctacgg 
attgactggc 
ggtgaaatct 
gtgatgattg 
gaaccgcgaa 
aagcagggcg 
cgtaaagaat 
cctgatattg 



tactggcacc 
acgactacga 
aagtctttca 
tggtgcgcgt 
cggtggagtt 
acggtagcgg 
aagcgatgcg 
gggacagcgg 
agctggtggt 
aggcgattgg 
gggactggca 
gtcgcggggc 
tgccgtggcc 
ataccgggtt 
acgatgaagc 
caagggctat 



gatggaggga 
tctgtgcatc 
tcgcatttgc 
gcagttgtta 
aggttcctgg 
cggcggggcg 
tgaagctgta 
tgagaagaaa 
gcatgggcgg 
cgatattcgc 
gagcgcgcaa 
gctcaatatt 
ggaggtggtt 
atatcacgtt 
aacggaatta 
tcaggcaatt 



gtgcttgact 
accgagtttg 
cctgagctac 
ggtcagttcc 
ggcgtggatc 
acgttactca 
ccggcgcatt 
tttgaaatcg 
acgaaagagc 
cagcggctga 
caatgcatgg 
cccaacctga 
gctttgctgc 
gcgcggatta 
tttcagcatg 
gatatcgaga 



ctctggtgcg 
tccgcgtggt 
aaaacgccag 
cacaatggct 
tcaattgcgg 
aagatcctga 
tgcccgtcag 
ccgatgcggt 
agggttaccg 
atattccggt 
cgatcagcgg 
gccgggtggt 
aaaaatatac 
aacagtggtt 
ttcgggtgtt 
aactctaa 



tgaattgctg 
ggatcaactg 
ccggacacca 
ggcagagaac 
ctgcccgtcg 
actcatctac 
cgtgaaagtg 
tcaacaggct 
cgcggagcat 
gattgccaac 
ctgcgacgca 
aaaatataac 
ccgtctggaa 
gagttatttg 
gaataattcc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
948 



<210> 147 
<211> 891 
<212> DNA 
<213> E. Coli 

<400> 147 

atgacaatat cgacaacttc cacgccgcat gatgcggtat ttaaatcttt tttacgccat 60 

ccagacaccg cgcgggattt tattgatatt catcttcccg cgccgctgcg caaactgtgt 120 

gatttaacga cgcttaaact ggaaccaaac agttttattg atgaagacct gcggcaatat 180 

tattccgacc tcttgtggtc tgtgaaaacg caggagggag tgggttatat ttatgtagtg 240 

atagagcacc aaagtaagcc ggaagaatta atggcttttc gcatgatgcg ttattccatt 300 

gcggcaatgc aaaaccatct tgatgcgggc tataaagagc ttccattggt gctcccgatg 360 

ctgttttatc atggttgcag aagtccttat ccttattcac tctgctggct tgatgaattt 420 

gccgagcctg ctatagcccg caaaatatat tcatcggctt ttccgttggt ggatattacc 480 

gtggtgccgg atgacgagat tatgcaacac cgcaaaatgg cgctgttgga gttaattcag 540 

aaacatattc gtcagcgcga tctgttggga ttagtcgacc aaattgtttc gctgctagtt 600 

acagggaaca ctaatgacag acagctaaaa gccctgttta attacgtatt acaaacaggg 660 

gatgcccagc gttttcgtgc atttattggt gagatagcgg aacgcgcacc acaagaaaag 720 

gagaaactga tgaccattgc tgacagatta cgtgaagaag gcgcaatgca gggcaaacac 780 

gaagaagccc tgcgtattgc tcaggagatg ctggatagag gtttagacag agagttagtt 840 

atgatggtga cccgactttc accagacgat cttatcgcgc aaagccacta a 8 91 

<210> 148 . 
<211> 1668 
<212> DNA 
<213> E. Coli 



<400> 148 

gtggctcaat tcgtttatac catgcatcgt gtcggcaaag ttgttccgcc gaaacgtcat 



60 
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attttgaaaa acatctctct gagtttcttc cctggggcaa aaattggtgt cctgggtctg 120 

aatggcgcgg gtaagtccac cctgctgcgc attatggcgg gcattgataa agacatcgaa 180 

ggtgaagcgc gtccgcagcc agacatcaag attggttatc tgccgcagga accgcagctg 240 

aacccggaac acaccgtgcg tgagtccatt gaagaagcgg tttcagaagt ggttaacgcc 300 

ctgaaacgcc tggatgaagt gtatgcgctg tacgccgatc cggatgccga ttttgacaag 360 

ctggccgctg aacaaggccg tctggaagag atcattcagg ctcacgacgg tcataatctg 420 

aacgtacagc tggagcgtgc ggcggatgcg ctacgtctgc cggactggga cgcgaaaatc 480 

gctaacctct ccggtggtga acgtcgtcgc gtagcgttgt gccgcctgct gctggaaaaa 540 

ccagacatgc tgctgctcga cgaaccgacc aaccacctgg atgccgaatc cgtggcctgg 600 

ctggaacgct tcctgcacga cttcgaaggc accgttgtgg cgattaccca cgaccgttac 660 

ttcctcgata acgttgcagg ctggatcctc gaacttgacc gcggtgaagg tattccgtgg 720 

gaaggtaact actcctcctg gctggagcag aaagatcagc gcctggcgca ggaagcttca 780 

caagaagcgg cgcgtcgtaa gtcgattgag aaagagctgg aatgggtacg tcaaggtact 840 

aaaggccgtc agtcgaaagg taaagcacgt ctggcgcgct ttgaagaact gaacagcacc 900 

gaatatcaga aacgtaacga aaccaacgaa ctgtttattc cacctggacc gcgtctgggc 960 

gataaagtgc tggaagtcag caacctgcgt aaatcctatg gcgatcgtct gctgattgat 1020 

gacctgagct tctcgatccc gaaaggagcg atcgtcggga tcatcggtcc gaacggtgcg 1080 

ggtaaatcga ccctgttccg tatgatctct ggtcaggaac agccggacag cggcaccatc 114 0 

actttgggtg aaacggtgaa actggcgtcg gttgatcagt tccgtgactc aatggataac 1200 

agcaaaaccg tttgggaaga agtttccggc gggctggata tcatgaagat cggcaacacc 1260 

gagatgccaa gccgcgccta cgttggccgc tttaacttta aaggggttga tcagggtaaa 1320 

cgcgttggtg aactctccgg tggtgagcgc ggtcgtctgc atctggcgaa gctgctgcag 1380 

gttggcggca acatgctgct gctcgacgaa ccaaccaacg acctggatat cgaaaccctg 1440 

cgcgcgctgg aaaacgccct gctggagttc ccgggctgtg cgatggttat ctcgcacgac 1500 

cgttggttcc tcgaccgtat cgccacgcac attctggatt accaggatga aggtaaagtt 1560 

gagttcttcg aaggtaactt taccgagtac gaagagtaca agaaacgcac gctgggcgca 1620 

gacgcgctgg agccgaagcg tatcaagtac aagcgtattg cgaagtaa 1668 



<210> 149 
<211> 522 
<212> DNA 
<213> E. Coli 

<400> 149 

atgtcaaagc caaaataccc ttttgaaaag cgccttgaag tcgtgaatca ctacttcaca 60 

actgatgatg gttacaggat catctcggca cgttttggtg tcccccgaac ccaggtcagg 120 

acatgggttg ccctctatga aaaacatgga gaaaaaggtt taattcccaa acctaaaggc 180 

gttagtgctg atccagagtt gcgtattaag gtcgtgaaag ctgtgatcga gcagcacatg 240 

tcccttaatc aggctgctgc tcactttatg cttgctggta gtggttctgt agccaggtgg 300 

ctgaaggtct atgaagagcg cggagaagct ggtttacgcg cgctcaagat tggcaccaaa 360 

agaaacattg caatatcagt tgatccagaa aaagcggcat cagcattgga gctgtcaaaa 420 

gaccgacgca ttgaggatct tgaaaggcaa gttcgatttc ttgaaacgcg gcttatgtat 480 

ctaaaaaagc tgaaagcctt agctcatccc acgaaaaagt ga 522 

<210> 150 
<211> 852 
<212> DNA 
<213> £. Coli 

<400> 150 

gtgaaagtac tcaacgagct aaggcagttt tatcctcttg atgagcttct cagggctgcg 60 

gagataccgc gcagtacgtt ttattatcat ctaaaggctc tcagcaagcc tgacaagtat 120 

gcggacgtta aaaagcgtat tagtgagatt tatcacgaga atagaggccg atacggatac 180 

cgtagggtaa cgctgtctct tcatcgagaa gggaaacaga ttaaccataa agctgttcag 240 

cgcctgatgg gaaccctctc acttaaagca gcgattaagg tcaagcgata ccgctcttac 300 

agaggagagg tagggcaaac cgcccctaat gttctccaaa gagatttcaa ggctacgcgg 360 

ccaaacgaga agtgggttac cgatgttact gaatttgcag tcaatgggcg caagctgtat 420 

ttgtctccag taatagatct cttcaacaac gaagttattt cttacagcct ttcggaaaga 4 80 

ccagtgatga acatggttga gaatatgctc gatcaggcat tcaaaaagct taatcctcac 540 

gagcatcctg ttctgcactc tgaccaggga tggcagtatc gtatgagaag atatcaaaat 600 
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atccttaaag aacatggtat taaacaaagc atgtccagaa aaggcaattg tctggataat 660 

gctgtggtgg agtgtttctt tggaacctta aagtcggagt gtttttatct tgatgagttc 720 

agtaatataa gcgaactgaa ggatgctgtt acggaatata ttgaatacta caacagcaga 780 

agaattagcc tgaaattaaa aggtctgact ccaattgaat atcggaatca gacctatatg 840 

cctcgtgttt aa 852 

<210> 151 

<211> 117 

<212> DNA 

<213> E. Coli 



<400> 151 

atgaaagttc gtgcttccgt caagaaatta tgccgtaact gcaaaatcgt taagcgtgat 60 
ggtgtcatcc gtgtgatttg cagtgccgag ccgaagcata aacagcgcca aggctga 117 

<210> 152 

<211> 1332 

<212> DNA 

<213> E. Coli 



<400> 152 

atggctaaac aaccgggatt agattttcaa agtgccaaag gtggcttagg cgagctgaaa 60 

cgcagactgc tgtttgttat cggtgcgctg attgtgttcc gtattggctc ttttattccg 120 

atccctggta ttgatgccgc tgtacttgcc aaactgcttg agcaacagcg aggcaccatc 180 

attgagatgt ttaacatgtt ctctggtggt gctctcagcc gtgcttctat ctttgctctg 240 

gggatcatgc cgtatatttc ggcgtcgatc attatccagc tgctgacggt ggttcaccca 300 

acgttggcag aaattaagaa agaaggggag tctggtcgtc gtaagatcag ccagtacacc 360 

cgctacggta ctctggtgct ggcaatattc cagtcgatcg gtattgctac cggtctgccg 420 

aatatgcctg gtatgcaagg cctggtgatt aacccgggct ttgcattcta cttcaccgct 480 

gttgtaagtc tggtcacagg aaccatgttc ctgatgtggt tgggcgaaca gattactgaa 540 

cgaggtatcg gcaacggtat ttcaatcatt atcttcgccg gtattgtcgc gggactcccg 600 

ccagccattg cccatactat cgagcaagcg cgtcaaggcg acctgcactt cctcgtgttg 660 

ctgttggttg cagtattagt atttgcagtg acgttctttg ttgtatttgt tgagcgtggt 720 

caacgccgca ttgtggtaaa ctacgcgaaa cgtcagcaag gtcgtcgtgt ctatgctgca 780 

cagagcacac atttaccgct gaaagtgaat atggcggggg taatcccggc aatcttcgct 840 

tccagtatta ttctgttccc ggcgaccatc gcgtcatggt tcgggggcgg tactggttgg 900 

aactggctga caacaatttc gctgtatttg cagcctgggc aaccgcttta tgtgttactc 960 

tatgcgtctg caatcatctt cttctgtttc ttctacacgg cgttggtttt caacccgcgt 1020 

gaaacagcag ataacctgaa gaagtccggt gcatttgtac caggaattcg tccgggagag 1080 

caaacggcga agtatatcga taaagtaatg acccgcctga ccctggttgg tgcgctgtat 1140 

attaccttta tctgcctgat cccggagttc atgcgtgatg caatgaaagt accgttctac 1200 

ttcggtggga cctcactgct tatcgttgtt gtcgtgatta tggactttat ggctcaagtg 1260 

caaactctga tgatgtccag tcagtatgag tctgcattga agaaggcgaa cctgaaaggc 1320 

tacggccgat aa ~ 1332 

<210> 153 
<211> 435 
<212> DNA 
<213> E. Coli 



<400> 153 

atgcgtttaa atactctgtc tccggccgaa ggctccaaaa aggcgggtaa acgcctgggt 60 

cgtggtatcg gttctggcct cggtaaaacc ggtggtcgtg gtcacaaagg tcagaagtct 120 

cgttctggcg gtggcgtacg tcgcggtttc gagggtggtc agatgcctct gtaccgtcgt 180 
ctgccgaaat. tcggcttcac ttctcgtaaa gcagcgatta cagccgaaat tcgtctgtct • 240 

gacctggcta aagtagaagg cggtgtagta gacctgaaca cgctgaaagc ggctaacatt 300 
atcggtatcc agatcgagtt cgcgaaagtg atcctggctg gcgaagtaac gactccggta • 360 

actgttcgtg gcctgcgtgt tactaaaggc gctcgtgctg ctatcgaagc tgctggcggt 420 

aaaatcgagg aataa ^ 435 

<210> 154 
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<211> 180 
<2I2> DNA 
<213> E. Coli 



<400> 154 



atggcaaaga ctattaaaat 
aaggcaacgc tgcttggcct 
actcctgcta ttcgcggtat 



tactcaaacc 
gggtctgcgt 
gatcaacgcg 



cgcagtgcaa 
cgtattggtc 
gtttccttca 



tcggtcgtct 
acaccgtaga 
tggttaaagt 



gccgaaacac 
gcgcgaggat 
tgaggagtaa 



60 
120 
180 



<210> 155 

<211> 504 

<212> DNA 

<213> E. Coli 

<400> 155 

atggctcaca tcgaaaaaca agctggcgaa ctgcaggaaa agctgatcgc ggtaaaccgc 60 

gtatctaaaa ccgttaaagg tggtcgtatt ttctccttca cagctctgac tgtagttggc 120 

gatggtaacg gtcgcgttgg ttttggttac ggtaaagcgc gtgaagttcc agcagcgatc 180 

cagaaagcga tggaaaaagc ccgtcgcaat atgattaacg tcgcgctgaa taacggcact 240 

ctgcaacacc ctgttaaagg tgttcacacg ggttctcgcg tattcatgca gccggcttcc 300 

gaaggtaccg gtatcatcgc cggtggtgca atgcgcgccg ttctggaagt cgctggggtt 360 

cataacgttc tggctaaagc ctatggttcc accaacccga tcaacgtggt tcgtgcaact 420 

attgatggcc tggaaaatat gaattctcca gaaatggtcg ctgccaagcg tggtaaatcc 480 

gttgaagaaa ttctggggaa ataa 504 

<210> 156 

<211> 354 

<212> DNA 

<213> E. Coli 

<400> 156 

atggataaga aatctgctcg tatccgtcgt gcgacccgcg cacgccgcaa gctccaggag 60 

ctgggcgcaa ctcgcctggt ggtacatcgt accccgcgtc acatttacgc acaggtaatt 120 

gcaccgaacg gttctgaagt tctggtagct gcttctactg tagaaaaagc tatcgctgaa 180 

caactgaagt acaccggtaa caaagacgcg gctgcagctg tgggtaaagc tgtcgctgaa 240 

cgcgctctgg aaaaaggcat caaagatgta tcctttgacc gttccgggtt ccaatatcat 300 

ggtcgtgtcc aggcactggc agatgctgcc cgtgaagctg gccttcagtt ctaa 354 

<210> 157 
<211> 534 
<212> DNA 
<213> E. Coli 

<400> 157 

atgtctcgtg ttgctaaagc accggtcgtt gttcctgccg gcgttgacgt aaaaatcaac 60 

ggtcaggtta ttacgatcaa aggtaaaaac ggcgagctga ctcgtactct caacgatgct 120 

gttgaagtta aacatgcaga taataccctg accttcggtc cgcgtgatgg ttacgcagac 180 

ggttgggcac aggctggtac cgcgcgtgcc ctgctgaact caatggttat cggtgttacc 240 

gaaggcttca ctaagaagct gcagctggtt ggtgtaggtt accgtgcagc ggttaaaggc 300 

aatgtgatta acctgtctct gggtttctct catcctgttg accatcagct gcctgcgggt 360 

atcactgctg aatgtccgac tcagactgaa atcgtgctga aaggcgctga taagcaggtg 420 

atcggccagg ttgcagcgga tctgcgcgcc taccgtcgtc ctgagcctta taaaggcaag 480 

ggtgttcgtt acgccgacga agtcgtgcgt accaaagagg ctaagaagaa gtaa 534 

<210> 158 
<211> 393 
<212> DNA 
<213> E. Coli 

<400> 158 

atgagcatgc aagatccgat cgcggatatg ctgacccgta tccgtaacgg tcaggccgcg 60 
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aacaaagctg cggtcaccat gccttcctcc aagctgaaag tggcaatcgc caacgtgctg 120 

aaggaagaag gttttattga agattttaaa gttgaaggcg acaccaagcc tgaactggaa 180 

cttactctga agtatttcca gggcaaagct gttgtagaaa gcattcagcg tgtcagccgc 240 

ccaggtctgc gcatctataa acgtaaagat gagctgccga aagttatggc gggtctgggt 300 

atcgcagttg tttctacctc taaaggtgtt atgactgatc gtgcagcgcg ccaggctggt 360 

cttggtggcg aaattatctg ctacgtagcc taa 393 



<210> 159 

<211> 306 

<212> DNA 

<213> E. Coli 

<400> 159 

atggctaagc aatcaatgaa agcacgcgaa gtaaaacgcg tagctttagc tgataaatac 60 

ttcgcgaaac gcgctgaact gaaagcgatc atctctgatg tgaacgcttc cgacgaagat 120 

cgttggaacg ctgttctcaa gctgcagact ctgccgcgtg attccagccc gtctcgtcag 180 

cgtaaccgct gccgtcaaac aggtcgtccg catggtttcc tgcggaagtt cgggttgagc 240 

cgtattaagg tccgtgaagc cgctatgcgc ggtgaaatcc cgggtctgaa aaaggctagc 300 

tggtaa 306 

<210> 160 
<211> 540 
<212> DNA 
<213> E. Coli 

<400> 160 

atggcgaaac tgcatgatta ctacaaagac gaagtagtta aaaaactcat gactgagttt 60 

aactacaatt ctgtcatgca agtccctcgg gtcgagaaga tcaccctgaa catgggtgtt 120 

ggtgaagcga tcgctgacaa aaaactgctg gataacgcag cagcagacct ggcagcaatc 180 

tccggtcaaa aaccgctgat caccaaagca cgcaaatctg ttgcaggctt caaaatccgt 24 0 

cagggctatc cgatcggctg taaagtaact ctgcgtggcg aacgcatgtg ggagttcttt 300 

gagcgcctga tcactattgc tgtacctcgt atccgtgact tccgtggcct gtccgctaag 360 

tctttcgacg gtcgtggtaa ctacagcatg ggtgtccgtg agcagatcat cttcccagaa 420 

atcgactacg ataaagtcga ccgcgttcgt ggtttggata ttaccattac cactactgcg 480 

aaatctgacg aagaaggccg cgctctgctg gctgcctttg acttcccgtt ccgcaagtaa 540 

<210> 161 
<211> 315 
<212> DNA 
<213> E . Coli 

<400> 161 

atggcagcga aaatccgtcg tgatgacgaa gttatcgtgt taaccggtaa agataaaggt 60 

aaacgcggta aagttaagaa tgtcctgtct tccggcaagg tcattgttga aggtatcaac 120 

ctggttaaga aacatcagaa gccggttccg gccctgaacc aaccgggtgg catcgttgaa 180 

aaagaagccg ctattcaggt ttccaacgta gcaatcttca atgcggcaac cggcaaggct 240 

gaccgtgtag gctttagatt cgaagacggt aaaaaagtcc gtttcttcaa gtctaacagc 300 

gaaactatca agtaa 315 

<210> 162 
<211> 372 
<212> DNA 
<213> E. Coli 

<400> 162 
atgatccaag aacagactat 
tgtatcaagg ttctgggtgg 
atcaccatca aagaagcaat 
gtagtggtgc gcaccaagaa 
ggtaatgctt gtgttcttct 
gggccggtaa ctcgtgagct 



gctgaacgtc gccgacaact ccggtgcacg tcgcgtaatg 60 

ctcgcaccgt cgctacgcag gcgtaggcga catcatcaag 120 

tccgcgtggt aaggtcaaaa aaggtgatgt gctgaaggcg 180 

gggtgttcgt cgcccggacg gttctgtcat tcgcttcgat 240 

gaacaacaac agcgagcagc ctatcggtac gcgtattttt 300 

tcgtagtgag aagttcatga aaattatctc tctggcacca 360 
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gaagtactct aa 

<210> 163 
<211> 567 
<212> DNA 
<213> E. Coli 



<400> 163 

atgtttaaag gacaaaaaac attggccgca ctggccgtat ctctgctgtt cactgcacct 60 

gtttatgctg ctgatgaagg ttctggcgaa attcacttta agggggaggt tattgaagca 120 

ccttgtgaaa ttcatccaga agatattgat aaaaacatag atcttggaca agtcacgaca 180 

acccatataa accgggagca tcatagcaat aaagtggccg tcgacattcg cttgatcaac 240 

tgtgatctgc ctgcttctga caacggtagc ggaatgccgg tatccaaagt tggcgtaacc 300 

ttcgatagca cggctaagac aactggtgct acgcctttgt tgagcaacac cagtgcaggc 360 

gaagcaactg gggtcggtgt acgactgatg gacaaaaatg acggtaacat cgtattaggt 420 

tcagccgcgc cagatcttga cctggatgca agctcatcag aacagacgct gaactttttc 480 

gcctggatgg aacaaattga taatgcagtc gatgtcacgg caggtgaagt aaccgctaac 540 

gcaacctacg tgctggatta taaataa 567 

<210> 164 
<211> 1284 
<212> DNA 
<213> E . Coli 



<400> 164 

atggctgata caaaagcaaa actcaccctc aacggggata cagctgttga actggatgtg 60 

ctgaaaggca cgctgggtca agatgttatt gatatccgta ctctcggttc aaaaggtgtg 120 

ttcacctttg acccaggctt cacttcaacc gcatcctgcg aatctaaaat tacttttatt 180 

gatggtgatg aaggtatttt gctgcaccgc ggtttcccga tcgatcagct ggcgaccgat 240 

tctaactacc tggaagtttg ttacatcctg ctgaatggtg aaaaaccgac tcaggaacag 300 

tatgacgaat ttaaaactac ggtgacccgt cataccatga tccacgagca gattacccgt 360 

ctgttccatg ctttccgtcg cgactcgcat ccaatggcag tcatgtgtgg tattaccggc 420 

gcgctggcgg cgttctatca cgactcgctg gatgttaaca atcctcgtca ccgtgaaatt 480 

gccgcgttcc gcctgctgtc gaaaatgccg accatggccg cgatgtgtta caagtattcc 540 

attggtcagc catttgttta cccgcgcaac gatctctcct acgccggtaa cttcctgaat 600 

atgatgttct ccacgccgtg cgaaccgtat gaagttaatc cgattctgga acgtgctatg 660 

gaccgtattc tgatcctgca cgctgaccat gaacagaacg cctctacctc caccgtgcgt 720 

accgctggct cttcgggtgc gaacccgttt gcctgtatcg cagcaggtat tgcttcactg 780 

tggggacctg cgcacggcgg tgctaacgaa gcggcgctga aaatgctgga agaaatcagc 840 

tccgttaaac acattccgga atttgttcgt cgtgcgaaag acaaaaatga ttctttccgc 900 

ctgatgggct tcggtcaccg cgtgtacaaa aattacgacc cgcgcgccac cgtaatgcgt 960 

gaaacctgcc atgaagtgct gaaagagctg ggcacgaagg atgacctgct ggaagtggct 1020 

atggagctgg aaaacatcgc gctgaacgac ccgtacttta tcgagaagaa actgtacccg 1080 

aacgtcgatt tctactctgg tatcatcctg aaagcgatgg gtattccgtc ttccatgttc 1140 

accgtcattt tcgcaatggc acgtaccgtt ggctggatcg cccactggag cgaaatgcac 1200 

agtgacggta tgaagattgc ccgtccgcgt cagctgtata caggatatga aaaacgcgac 1260 

tttaaaagcg atatcaagcg ttaa 1284 

<210> 165 
<211> 1434 
<212> DNA 
<213> E. Coli 



<400> 165 

atgaaagtaa cgctgccaga gtttgaacgt gcaggagtga tggtggttgg tgatgtgatg 60 

ctggatcgtt actggtacgg ccccaccagt cgtatctcgc cggaagcgcc ggtgcccgtg 120 

gttaaagtga ataccatcga agaacgtccg ggcggcgcgg ctaacgtggc gatgaatatc 180 

gcttctctcg gtgctaatgc acgcctggtc gggttgacgg gcattgacga tgcagcgcgc 240 

gcgctgagta aatctctggc cgacgtcaac gtcaaatgcg acttcgtttc tgtaccgacg 300 

catccgacca ttaccaaatt acgggtactt tcccgcaacc aacagctgat ccgtctggat 360 

tttgaagaag gtttcgaagg tgttgatccg cagccgctgc acgagcggat taatcaggcg 420 
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ctgagttcga 
cagcagatga 
accgattttg 
gctgttgtcg 
gccgattacg 
caaccgggta 
ggtgcgggcg 
gaagaagcct 
tccacggttt 
ggcgtgatga 
aaagtggtga 
gcaaatgccc 
aaacggctga 
ggcgcactgg 
atcgccggga 
gccgggagta 
ggttgctcga 



ttggcgcgct 
tccaactggc 
agcgctaccg 
gtaaatgtaa 
aactctcggc 
aagcgccgct 
acacggtgat 
gcttctttgc 
cgccgatcga 
ccgaagagga 
tgaccaacgg 
gcaagctggg 
aaggggattc 
aagcggtcga 
tcttgccaga 
aagaagtctg 
cgaccaacat 



ggtgctttct 
gcgtaaagcg 
cggcgctacg 
gaccgaagaa 
tctgttagtg 
gcatatgcca 
tggcgtcctg 
caatgcggcg 
gctggaaaat 
actgaagctg 
tgtctttgac 
tgaccgcttg 
ccgcccggta 
ctgggtagtg 
tctgctggtg 
ggccaacggt 
catcaagaag 



gactacgcca 
ggtgttccgg 
ctgttaacgc 
gagattgttg 
acccgttccg 
acccaagcgc 
gcggcaacgc 
gctggcgtgg 
gctgtacgtg 
gccgtagcgg 
atcctgcacg 
attgttgccg 
aacccactcg 
tcgtttgaag 
aaaggcggcg 
ggcgaagtgt 
atccaacagg 



aaggtgcgct 
tgctgattga 
cgaatctctc 
agcgcggcat 
aacagggtat 
aggaagtgta 
tggcagcggg 
tggtcggcaa 
gacgtgcaga 
cagcgcgtaa 
ccgggcacgt 
tcaacagcga 
aacagcgtat 
aggacacgcc 
actataaacc 
tggtgctcaa 
ataaaaaagg 



ggcaagcgta 
tccaaaaggt 
ggaatttgaa 
gaaactgatt 
gtcgctgctg 
tgacgttacc 
taattcgctg 
actgggaacc 
tacaggcttt 
acgtggtgaa 
ctcttatctg 
tgcctccacc 
gattgtgctg 
gcagcgcttg 
agaagagatt 
ctttgaagac 
ctaa 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1434 



<210> 166 
<211> 2841 
<212> DNA 
<213> E. Coli 



<400> 166 

atgaagccgc tctcttcacc gttacagcag tactggcaga ccgttgttga gcggctgcca 60 

gagcctttag ccgaggaatc acttagcgca caggcgaagt cagtacttac ttttagtgat 120 

tttgtgcagg acagcgtgat tgcgcatcca gagtggctga cggaactgga aagccaaccg 180 

ccgcaggccg acgaatggca gcattacgcg gcatggttgc aggaggcgct ctgtaatgtg 24 0 

agtgacgaag ccgggttaat gcgcgagctg cggctattcc ggcggcgcat tatggtgcgc 300 

atcgcctggg cgcaaacgct ggcactggtt actgaagaga gcatattgca gcagctcagc 360 

tatctggcgg agacgctgat tgttgcggcg cgtgactggc tgtatgacgc ctgctgccgc 420 

gagtggggaa cgccgtgcaa tgcgcagggc gaagcgcaac cgctgctgat tttaggcatg 480 

ggtaagctgg gcggtgggga gctgaatttc tcctctgata tcgatctgat ttttgcctgg 540 

ccggaacatg gttgtacgca gggtggacgc cgggaactgg ataacgcgca gttttttacc 600 

cgcatggggc agcggctgat taaagtgctg gatcaaccaa cgcaggatgg cttcgtctat 660 

cgcgtggata tgcggctgcg tccgtttggc gaaagtggcc cgctggtgct gagctttgcc 720 

gcgttggaag attattacca ggagcagggg cgcgactggg agcgttacgc gatggtcaag 780 

gcgcggatta tgggcgatag cgaaggcgtc tatgctaacg agttgcgtgc gatgctgcgc 840 

ccgtttgttt tccgtcgtta catcgatttc agcgtgattc agtcgctgcg caacatgaaa 900 

gggatgattg cccgtgaagt gcgtcgacgt ggtttgaccg acaatatcaa actcggcgca 960 

ggcggcattc gcgaaattga atttatcgtt caggtgttcc agctcattcg cggcggacgc 1020 

gaaccgtcgc tgcaatcgcg ctctttactg ccaacgctca gcgccattgc cgagctgcat 1080 

ctgctttctg aaaacgatgc tgaacaattg cgagtggcgt atctgttcct gcggcgtctg 1140 

gaaaacctgc tgcaaagcat taacgacgaa caaacccaga cgcttccttc tgatgagctt 1200 

aatcgtgcgc ggctggcgtg ggcgatggac tttgctgact ggccgcaact gaccggggcg 1260 

ctgaccgcac atatgaccaa tgtgcgccgg gtgtttaatg aattgattgg cgacgatgaa 1320 

agtgaaactc aggaagagtc gctgtcggaa cagtggcgtg agctgtggca ggatgcgttg 1380 

caggaagatg acactacgcc agtgctggcg catcttagcg aggatgatcg caaacaggtg 1440 

ctaacgctga ttgccgattt ccgcaaagag ctggataagc gcaccatcgg gccgcgagga 1500 

cgtcaggtgc tcgaccatct gatgccgcat ctgctaagtg atgtctgtgc gcgtgaagac 1560 

gctgccgtta cgctgtcgcg cattaccgcc ttgctggtgg ggattgttac ccgcaccacc 1620 

tatttagaat tgctcagtga attccccgcg gcgcttaaac atttgatttc tctgtgtgcc 1680 

gcgtcgccga tgattgccag ccagctggcg cgttatccat tattgctgga tgaattgctc 1740 

gatccaaaca ccctttacca gccgacggcg accgatgcct accgcgatga gttgcgccag 1800 
tatttgctgc gcgtgccgga agatgacgaa gagcaacagc ttgaggcgct gcgtcagttc ' 1860 

aaacaggcgc agctgttacg catcgccgca gcggatatcg ccggtacgct accggtgatg 1920 

aaagtgagcg atcacttaac ctggctggcg gaagccatga tagatgccgt cgttcagcag 1980 

gcgtgggttc aaatggttgc ccgctacggt aagccgaatc acctgaacga acgcgaaggg 2040 

cgtggttttg cggtggtcgg ctacggcaag ctgggcggct gggagttagg ctacagttcc 2100 

gatcttgacc ttatcttcct ccatgattgc ccaatggatg cgatgactga cggtgagcgg 2160 

gaaatcgacg ggcggcagtt ttatctgcgt ctggcgcaac gcattatgca tctgttcagt 2220 
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acgcgtacct cttccggcat tttgtatgaa gtggatgctc gactgcgtcc gtccggggcg 2280 

gcgggaatgc tggtgacatc cgcagaagca tttgccgatt atcagaaaaa cgaggcctgg 2340 

acgtgggaac atcaggcgct ggtgcgtgcg cgtgtagtgt acggcgatcc gcagctcacc 24 00 

gcgcactttg acgcagtgcg tcgcgagatt atgacgctgc cgcgtgaagg taaaactctg 24 60 

caaacggaag tgcgggaaat gcgcgagaaa atgcgcgctc atctcggcaa taaacatcgc 2520 

gatcgctttg atatcaaagc tgatgaaggg ggaattaccg atatcgaatt tattacccaa 2580 

tatctggtgt tgcgctacgc tcatgaaaaa ccgaagttaa cgcgctggtc agacaacgrg 2640 

cgtattctgg aactactggc gcaaaacgac attatggaag agcaggaagc gatggcgcrg 2700 

acccgtgctt acactacgct tcgcgatgaa cttcatcatc tggcattaca ggaattgccg 2760 

ggccatgtgt cggaggattg cttcaccgca gagcgtgaac tggtgcgggc aagctggcag 2820 

aagtggctgg tggaagaatg a 2841 

<210> 167 
<211> 1302 
<212> DNA 
<213> E. Coli 



<400> 167 

atggctcagg aaatcgaatt aaagtttatt gttaatcaca gtgccgttga ggcgttgcgt 60 

gaccatctca atacgctggg cggcgagcac catgaccccg tgcagttgct gaatatttac 120 

tacgaaacgc cggataactg gctgcgtggg cacgatatgg gcttacgtat tcgtggcgaa 180 

aacggtcgct atgagatgac catgaaagtt gcaggaagag tgacaggcgg cttacatcag 240 

cgcccggaat ataacgtggc gttgagcgaa ccgacgctcg acctggcgca gttaccgacg 300 

gaagtctggc cgaacggcga attgcccgcc gatctcgcct cccgcgtgca gccgctgttc 360 

agcaccgatt tttatcgcga aaaatggctg gtggcggtcg atggtagcca aattgaaatc 420 

gccctcgacc agggggaagt gaaagcgggt gaatttgctg aacctatctg tgagctggaa 480 

ctggaactgc ttagcggcga cacgcgcgcg gtgctgaaac tggcgaacca actggtatcg 540 

caaaccggat tacgccaggg cagcctgagc aaagcggcgc gtggctatca tctggcgcag 600 

ggcaatccgg cgcgtgaaat caaaccgacc accattttgc atgttgcggc aaaagccgat 660 

gtggaacagg ggctggaagc ggcgctcgag ctggcgttag cgcaatggca gtatcatgaa 720 

gaactgtggg tacgcggcaa cgatgcggcg aaagaacagg tgctggcagc cattagcctg 780 

gtccgtcata cgctgatgct gttcggtggt attgtgccgc gtaaagcgag cactcactta 840 

cgtgatctgc tgactcaatg cgaggcgacc attgcttctg cggtgtctgc cgtgacggcg 900 

gtctactcta ccgaaacggc aatggcgaag ctggcgttga ccgaatggtt ggtaagcaaa 960 

gcatggcagc catttttaga tgccaaagcg cagggcaaaa tcagcgactc cttcaaacgc 1020 

tttgccgata tccatctttc ccgccatgcc gctgaactga aaagcgtttt ctgccagccg 1080 

ttaggcgatc gctaccgtga ccagttgcca cgcctgacgc gtgatattga ctcaatactg 1140 

ttgctggcgg gttactatga tcctgtcgtc gcgcaagcct ggctggagaa ctggcagggg 1200 

ctgcatcacg ctattgcgac cgggcaacgc atcgaaattg aacatttccg taatgaggca 1260 

aacaatcagg aaccgttctg gttgcacagc ggaaaacgtt aa 1302 



<210> 168 
<211> 213 
<212> DNA 
<213> E. Coli 



<400> 168 

atgtccggta aaatgactgg tatcgtaaaa tggttcaacg ctgacaaagg cttcggcttc 60 

atcactcctg acgatggctc taaagatgtg ttcgtacact tctctgctat ccagaacgat 120 

ggttacaaat ctctggacga aggtcagaaa gtgtccttca ccatcgaaag cggcgctaaa 180 

ggcccggcag ctggtaacgt aaccagcctg taa 213 

<210> 169 
<211> 1572 
<212> DNA 
<213> E. Coli 



<400> 169 

atgagggaca ttgtggaccc tgtattctct atcggtatct catcattatg ggatgagctg 60 

cgacatatgc cagcaggcgg cgtctggtgg tttaacgtcg atcgccatga agatgctatc 120 

agtctggcga atcaaacaat tgcatcccag gctgaaaccg cacacgtcgc ggtcattagc 180 
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atggacagcg 
aaattatttt 
tgttctattg 
aacattcctg 
aaccattgtt 
ttgcttgagg 
catttgctgg 
agcgttcagc 
ccacgcagcg 
ctatcggaac 
gctcaggcgg 
cgcagcattc 
aataccgcta 
gttattccgt 
cagaagttta 
ctcaaactgc 
aataaccctc 
ggtatccgcg 
accattggcg 
accgcgttga 
tggtttgaag 
caatggggca 
gatggtcgcc 
cgctcatcat 



atccggcgaa 
caatgctaaa 
atccccataa 
ccgagcggct 
cgcttttggt 
aataccgttc 
atattgcctt 
aacaaaatgg 
acgaaaaacg 
actggcaact 
cgacggtggt 
ataccctgcg 
gcctgcgcgc 
ggaatgcgcc 
gtcgctatgt 
gtggtttcca 
tattacctgc 
ttgaacaagc 
gtaatcggct 
atcatatttt 
atgatcaaat 
tgccgctgcc 
actggcgacg 
ga 



aatctttcaa 
tcatgaaaaa 
ttaccttttt 
tcgctcatgg 
aattaatccc 
actttttggt 
ctggtgcaac 
tatctggaca 
cattctgagt 
gttcaacaat 
cttttcttta 
tcgccagcgc 
caccgatgaa 
actctcccgt 
gccggaagat 
gaagtgggat 
ccacggtaaa 
cctgacgctg 
ggtgctgttt 
cccattgcct 
cagtgccgag 
tttaacgcaa 
aataccagaa 



ttagatgatt 
ggtctatact 
attcttgttt 
ttggataaaa 
ggaaataata 
cttgccagtt 
gaaaaagggg 
ttagttcaaa 
aatgttgctg 
aacgaagtcc 
cagcaaaatg 
ggtagtgcga 
cgtttgttat 
tgtctgacga 
atcactacct 
gtgttctgta 
ggcgttctgg 
tgtcgcccta 
ctctcattct 
actggcgaca 
ctggtgcaga 
agttctaaac 
cccatgcgac 



ctcaagggcc 
atttgacccg 
gcgcaaataa 
tgaataaatg 
acgataaaca 
tgcgttttca 
tcagcgcccg 
gcgaagaggc 
tactggaagg 
tgttcaatga 
cgcaaatcga 
tgaaaatcct 
tggcctgcgg 
tgatcgaaag 
tgctgtcaat 
atgccgtcaa 
ttgccctacg 
accgtaccgg 
gtcggattaa 
ttttctcaaa 
tgcgcttgct 
cggtcatcaa 
tgttagatga 



ggaaaaaata 
tgatttgcag 
cgcatggcaa 
gagcaggtta 
attttcattg 
gggtgaccaa 
tcagcagctt 
ggagatccaa 
tgcgccgccg 
agcccgtacc 
gccactggcc 
cgtgcgggaa 
tgcaaatatg 
cgtgcaaggg 
gacccagccg 
caacatgatg 
tccggtaccg 
cgatatcatg 
cgatctggat 
ccgtatggtc 
tgccccagaa 
tgccgagcac 
tgctgtggag 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1572 



<210> 170 
<211> 189 
<212> DNA 
<213> E. Coli 



<400> 170 

atgaccatca gcgatatcat tgaaattatt gtcgtttgcg cactgatatt tttcccgctg 60 

ggctatctgg cgcggcactc tttgcgacgc attcgcgaca ccttacgttt gttctttgct 120 

aaacctcgtt atgttaaacc ggccgggacg ttacgccgca cggaaaaagc cagggcaacc 180 

aaaaaatga ^ ' ^gg 

<210> 171 

<211> 1680 

<212> DNA 

<213> E. Coli 



<400> 171 

atgactcaat ttacgcaaaa taccgccatg ccttcttccc tctggcaata ctggcgcggc 60 

ctttccggct ggaacttcta ttttctggtt aagttcggcc tgttgtgggc gggatatctt 120 

aacttccatc cgctcctcaa tttggtgttt gccgcgtttc tgctgatgcc ccttccgcgc 180 

tacagcctgc atcgcttgcg ccactggatt gccctgccga tcggctttgc tttgttctgg 240 

catgacacct ggttgcctgg cccggaaagc ataatgagcc agggttcgca ggtggcgggg 300 

ttcagtaccg attatttaat cgaccttgtc acacgcttta ttaactggca gatgattggg 360 

gccatttttg ttttattagt ggcctggtta ttcctgtcac aatggattcg cattaccgtt 420 

tttgtggttg ccatactgct atggctgaac gtacttaccc tggcgggacc aagtttctcc 480 

ttgtggccag ccggacaacc gacgaccact gtaacaacga cgggtggtaa cgcagcggca 540 

accgttgcgg cgacgggtgg cgcaccggta gtgggtgata tgcccgcaca aactgcaccg 600 

ccaacaacgg cgaaccttaa cgcctggctg aataatttct ataacgcgga ggcgaaacgt 660 

aaatcgacct tcccgtcttc gctgcccgct gatgctcagc catttgaact actggtgatt 720 

aacatctgtt cgctttcctg gtcggatata gaagccgccg ggttgatgtc gcatccactg 780 

tggtcgcatt tcgatattga gttcaagaac tttaactccg ccacctecta cagtggcccg 840 

gcggcgatcc gtttactgcg cgccagctgc gggcagactt cgcacactaa tctgtatcaa 900 

ccggcaaata acgactgcta tctgtttgat aacctttcga aactgggctt tacccagcac 960 

ctgatgatgg gacataacgg ccagttcggc ggttttttga aagaagttcg cgaaaatggc 1020 

ggcatgcaga gcgaattgat ggatcaaaca aatctgccgg ttattttgct gggctttgat 1080 

ggttcgccgg tttatgacga taccgctgtg cttaaccgct ggctggacgt taccgaaaaa 1140 
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gataaaaaca gccgtagtgc cacgttctac aacacgcttc cactgcatga cggcaaccat 1200 

tatccggggg tcagcaaaac agcggattac aaagcgcggg cgcagaaatt ctttgatgaa 1260 

ctggacgcct tctttactga acttgagaaa tcgggtcgta aagtgatggt ggtcgtggtg 1320 

ccggaacacg gcggcgcgct gaagggcgac agaatgcagg tatctggcct acgtgatatc 1380 

cctagcccgt ctatcaccga cgtccccgtt ggggtgaaat tcttcggcat gaaggcaccg 1440 

catcaggggg caccgattgt catcgaacaa ccgagcagct tcctggctat ctccgatctg 1500 

gtggttcgcg ttctcgatgg caagattttc accgaagaca atgttgactg gaaaaaactc 1560 

accagtgggt tgccacaaac agcaccggtc tccgagaact caaatgcagt agttattcaa 1620 

taccaggata aaccgtacgt tcgcctgaac ggcggcgact gggtgcctta cccgcagtaa 1680 

<210> 172 
<211> 384 
<212> DNA 
<213> E. Coli 

<400> 172 

atggaaggtt caagaatgaa ataccgcatc gctttagctg tttctctctt tgctcttagt 60 

gccggtagtt atgccactac cctgtgtcag gaaaaggagc aaaatatcct taaggagatc 120 

agctatgccg aaaaacacca aaaccagaat cgtattgacg gtctgaataa agccctgagt 180 

gaagtccggg ccaactgttc agatagccag ctgcgtgccg atcatcagaa gaaaatcgca 240 

aagcagaaag atgaggtggc ggaacgccag caagatttag ccgaggcgaa gcaaaaaggc 300 

gatgccgata agattgccaa acgcgaacgg aaactggcag aagcgcagga agagctgaaa 360 

aagctggaag cgcgcgacta ctaa 384 

<210> 173 

<211> 306 

<212> DNA 

<213> E . Coli 

<400> 173 

atgtcgaaag aacacactac ggaacatctg cgtgctgagt tgaaatccct ttccgatacg 60 

ctggaagagg tgcttagctc atctggcgag aagtcgaaag aagagttgag taagattcgt 120 

agcaaagcgg agcaggcact gaaacagagc cgttatcgcc tgggtgaaac cggtgatgcc 180 

attgccaaac aaacccgtgt cgcggcggcg cgtgccgatg agtatgtgcg cgaaaatccg 24 0 

tggacgggcg tgggcattgg cgctgcaatc ggtgtagtgc tcggcgttct gctgtcgcgt 300 

cgttaa " 306 

<210> 174 
<211> 405 
<212> DNA 
<213> E. Coli 

<400> 174 

atggcggaca ctcatcacgc acaagggccc ggtaaaagcg ttctgggcat cgggcagcga 60 

attgtttcta tcatggttga aatggtagag acacgtctgc ggctggcggt ggtggagctg 120 

gaagaggaaa aagcgaatct ctttcaactt ttactgatgc tgggcctgac gatgcttttc 180 

gctgcatttg gtcttatgag cctgatggtg ctaattattt gggcggttga cccgcaatat 240 

cgcctgaatg cgatgattgc caccaccgtg gtgttgctgc tactggcact gattggcggt 300 

atctggacgc tacgtaaatc gcgtaagtct acgttgctgc gccatacacg ccatgagtta 360 

gcaaacgatc ggcagctgct cgaggaggag tcccgtgagc agtaa 4 05 

<210> 175 
<211> 300 
<212> DNA 
<213> E. Coli 

<400> 175 

gtgagcagta aagtcgaacg tgaacgacgt aaggcgcaac tgcttagcca gatccagcaa 60 

caacggctgg atctttccgc cagtcgtcgt gaatggctgg agacaacagg cgcttacgat 120 

cgtcgctgga atatgctgct aagtctgcgc tcctgggcgc tggttggcag tagcgtgatg 180 

gcgatctgga cgattcgcca tcctaatatg ctggtccgct gggccagacg cggttttggc 24 0 
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gtatggagcg cctggcgtct ggttaaaacg accctcaagc agcaacagct tcgcggttaa 



300 



<210> 176 
<211> 483 
<212> DNA 
<213> E. Coli 



<400> 
atgattctct 
agcagcctga 
gtagcgcgca 
tacgcgggta 
gtgattctgc 
acagccctgt 
gctgaaggcg 
ctgctggcaa 
taa 



176 
ccatcgacag 
gtggcgcagt 
ttttaatgcc 
cccaacaata 
ttgagtttgg 
ttactgcggg 
tcaactcgct 
ttaccggtcc 



caacgacgct 
ggagagtatg 
gattctgttt 
tatggaagca 
tggtggtctg 
ctttacgctg 
gatgttcatg 
gggcgcgtat 



aataccgcgc 
atgaaaaaat 
attaccgctg 
atgggcgtcc 
gcaatcctgt 
ctgacggcat 
aaaaacctga 
agcatcgacc 



cattgcacaa 
tagaagatgt 
gctggggaaa 
cgggttttat 
tcggtttcct 
ttttatttca 
caatttctgg 
gcctgctgaa 



aaaaacaatc 
tggtgtactg 
aattactggc 
gctgccactg 
gactcgcacc 
cagcaacttt 
cggattcctg 
taaaaagtgg 



60 
120 
180 
240 
300 
360 
420 
480 
483 



<210> 177 
<211> 891 
<212> DNA 
<213> E. Coli 



<400> 177 



atgatcaaga 
gcccactacc 
ccagaaataa 
ctggggccac 
catatgcata 
gcgcatttcg 
aaaatttatt 
cgggttcttg 
ttaatgttgt 
ggcgatgtag 
acggtttatc 
gaagcatgtc 
tctgaagggc 
cctctggatg 
agaaatcgct 



agacaacgga 
agtggctggt 
cggataacca 
tcgataacga 
actgtggtcg 
acgcctttca 
tgctgacgat 
atgaatcctt 
tggatattga 
tattacgcac 
gctacggggg 
gtgcaggtgt 
atatcaacat 
tggtcattgg 
gcatgtttat 



aattgatgcc 
gagtatgttt 
ttcttatgga 
tgaattacct 
ggaattaatg 
ggaggggttg 
ccgtagcaat 
tgatcatcag 
ccgatttaaa 
cctggcaact 
cgaagaattt 
cagaatttgc 
taccgtgaca 
aagagcggac 
tgacgaacaa 



atcttgttaa 
cacagcgtgg 
ctgtgccagt 
tacgttcggc 
ctggctattg 
ctttctttta 
atggatgttt 
ttacgcaacg 
ttggttaatg 
tacttagcca 
atcattattg 
cagttagtcg 
gcaggtgtga 
cgggcaatgt 
aatgtgatta 



atctcaataa 
tcgcgagaga 
ttggtcggtg 
taatggattc 
ttgaaaatca 
ctgcggcatt 
tgacgggatt 
ctgagcctct 
atacctacgg 
gttggacgcg 
tcaaagcggc 
ataaccatgc 
gtcgcgcatt 
atgagggtaa 
accgagttta 



ggctatcgat 
tgccagtaag 
gattgatcat 
tgcccatcaa 
ctggcaggac 
aaccgattac 
gccgggtcgt 
gaatctttat 
gcatttaatc 
tgattacgaa 
taatgatgaa 
catcacacat 
tcctgaagag 
gcaaaccgga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
891 



<210> 178 
<211> 612 
<212> DNA 
<213> E . Coli 



<400 
atgcgccttc 
tataccccta 
gatatgttgg 
cttatgtata 
gtcgcattca 
tctttcgtgg 
ttactaccag 
gatttcctta 
aaattagaac 
ctcgtcaccg 
tttaatcgtt 



> 178 
gtgttgtgcc 
cagcgagagc 
atgtatttac 
cgattatcat 
atgcttatac 
tcgccatttc 
agcaacgagt 
aaacaggaaa 
ggaagggcat 
aaaaatactc 
ag 



cggttttatt 
ttgtgttaac 
tccattgttg 
ttttggtctc 
tgaaatacct 
cttctcaaga 
attgcttcgt 
tcttattatc 
cattcaacat 
ccattttatg 



tcaccacctc 
atttcaatac 
aaactttttg 
actctctggc 
tggctctttc 
ttgcgagcac 
ttatctgaga 
acttctcctt 
cagagtgata 
aagttattct 



cgggcttcgg 
ccttacagtt 
ctaacgagcc 
tgataccgaa 
agattatcgt 
atattcaaaa 
aagaaatcgc 
gccgtaaccc 
gcgcaaactg 
ggaacagcag 



tggtctcggc 
gagagttatt 
actcgaaaga 
agagtttact 
ttttgccttt 
gcattattca 
tgtatttaaa 
ggttatgaaa 
ttcttattat 
gagtagacgt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
612 



<210> 179 
<211> 177 
<212> DNA 
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<213> E. Coli 



<400> 179 



gtgcttctcc aaccatcggc gcgcaccagt 
cgtcatggca gtgaacgttc catcctggtt 
gttgccgagg tcgatttttt gcattttgcg 



ttcggtttta 
ggggaacacg 
aatctcacat 



aatgttttgc 
ccgcacacca 
cttgttgcta 



ttttggtata 
gggattcgtt 
cgtatag 



60 
120 
177 



<210> 180 
<211> 4281 
<212> DNA 
<213> E. Coli 

<400> 180 

atgagcggaa aaccagcggc gcgtcaggga gatatgactc agtatggcgg tcccattgtc 60 

cagggttcgg caggtgtaag aattggcgcg cccaccggcg tggcgtgctc ggtgtgtccg 120 

ggcgggatga cttcgggcaa cccggtaaat ccgctgctgg gggcgaaggt gctgcccggc 180 

gagacggacc ttgcgctgcc cggcccgctg ccgttcattc tctcccgcac ctacagcagc 240 

taccggacga agacgcctgc accggtgggc gttttcggcc ccggctggaa agcgccttct 300 

gatatccgct tacagctacg tgatgacgga ctgatactca acgacaacgg cgggcggagc 360 

attcactttg agccgctgct gccgggggag gcggtgtaca gccgcagtga gtcaatgtgg 420 

ctggtgcgcg gtggtaaggc agcacagccg gacggccata cgctggcgcg gctgtggggg 4 80 

gcgctgccgc cggatatccg gttaagcccg catctttacc tggcgaccaa cagcgcacag 540 

gggccgtggt ggatactggg gtggtctgag cgggtgccgg gtgctgagga cgtactgcca 600 

gcgccgctgc cgccgtaccg ggtgcttacc gggatggcgg accgcttcgg gcggacgctg 660 

acgtaccggc gtgaggccgc cggtgacctg gccggggaaa tcaccggcgt gacggacggt 720 

gccgggcggg agttccgtct ggtgctgacc acgcaggcgc agcgtgcgga agaggcccgc 780 

acctcttcgc tatcttcttc tgacagttcc cgccctctct cagcctcagc gttccccgac 840 

acactgcccg gtaccgaata cggccccgac aggggtatcc gcctttcggc ggtgtggctg 900 

atgcacgacc cggcataccc ggagagcctg cccgctgcgc cactggtgcg gtacacgtat 960 

acggaagccg gtgaactgct ggcggtatat gaccgcagca atacgcaggt gcgcgctttc 1020 

acgtatgacg cgcagcaccc gggccggatg gtggcgcacc gttacgcggg aaggccggag 1080 

atgcgctacc gctacgacga tacggggcgg gtggtggagc aactgaaccc ggcagggtta 1140 

agctaccgct atctttatga gcaggaccgc atcaccgtca ccgacagcct gaaccggcgt 1200 

gaggtgctgc atacagaagg cggggccggg ctgaaacggg tggtgaaaaa agaactggcg 1260 

gacggcagcg tcacgcgcag cgggtatgac gcggcaggaa ggctcacggc gcagacggac 1320 

gcggcgggac ggaggacaga gtacggtctg aatgtggtgt ccggcgatat cacggacatc 1380 

accacaccgg acgggcggga gacgaaattt tactataacg acgggaacca gctgacggcg 1440 

gtggtgtccc cggacgggct ggagagccgc cgggaatatg atgaaccggg caggctggta 1500 

tcggagacat cgcgcagcgg ggagacagta cgctaccgct acgatgacgc gcacagtgag 1560 

ttaccggcga cgacaacgga tgcgacgggc agcacccggc agatgacctg gagccgctac 1620 

gggcagttgc tggcgttcac cgactgctcg ggctaccaga cccgttatga atacgaccgc 1680 

ttcggccaga tgacggcggt ccaccgcgag gaaggcatca gcctttaccg ccgctatgac 1740 

aaccgtggcc ggttaacctc ggtgaaagac gcacagggcc gtgaaacgcg gtatgaatac 1800 

aacgccgcag gcgacctgac tgccgttatc accccggacg gcaaccggag cgagacacag 1860 

tacgatgcgt ggggaaaggc ggtcagcacc acgcagggcg ggctgacgcg cagtatggag 1920 

tacgatgctg ccggacgtgt catcagcctg accaacgaga acggcagcca cagcgtcttc 1980 

agttacgatg cgctggaccg gctggtacag cagggcggct ttgacgggcg gacgcaacgt 2040 

tatcattatg acctgaccgg aaaactcaca cagagtgagg atgagggact tgtcatcctc 2100 

tggtactacg atgaatcgga ccgtatcact caccgcacgg tgaacggcga accggcagag 2160 

cagtggcagt atgatggcca cggctggctg acagacatca gccacctgag cgaaggccac 2220 

cgtgttgccg tccactatgg ctatgacgat aaaggccgcc tgaccggcga atgccagacg 2280 

gtggagaacc cggagacggg ggaactgctg tggcagcatg agacgaaaca cgcatacaac 234 0 

gagcaggggc tggcaaaccg cgtcacgccg gacagcctgc cgccggtgga gtggctgacg 2400 

tatggcagcg gttacctggc gggaatgaag ctgggcggga cgccgctggt cgagtatacg 24 60 

cgggacaggc tgcaccgtga gacggtgcgc agcttcggca gcatggcagg cagtaatgcc 2520 

gcatacgaac tgaccagcac atacaccccc gcaggccagt tacagagcca gcacctgaac 2580 

agcctggtat atgaccgtga ctacgggtgg agtgacaacg gcgacctggt gcgcatcagc 2640 

ggcccgcgac agacgcggga atacggctac agcgccacgg gcaggctgga gagtgtgcgc 2700 

accctcgcac cagacctgga catccgcatc ccgtatgcca cggacccggc gggcaaccgg 2760 

ctgccggacc cggagctgca cccggacagt acactcacag tgtggccgga taaccgcatc 2820 

gcggaggatg cgcactatgt ctaccgccac gatgaatacg gcaggctgac ggagaagacg 2880 
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gaccgcatcc cggcgggtgt gatacggacg gacgacgagc ggacccacca ctaccactac 2940 

gacagccagc accgcctggt gttctacacg cggatacagc atggcgagcc actggtcgag 3000 

agccgctacc tctacgaccc gctgggacgg cgaatggcaa aacgggtctg gcggcgggag 3060 

cgtgacctga cggggtggat gtcgctgtcg cgtaaaccgg aggtgacgtg gtatggctgg 3120 

gacggagaca ggctgacgac ggtgcagact gacaccacac gtatccagac ggtatacgag 3180 

ccgggaagct tcacgccgct catccgggtc gagacagaga acggcgagcg ggaaaaagcg 3240 

cagcggcgca gcctggcaga gacgctccag caggaaggga gtgagaacgg ccacggcgtg 3300 

gtgttcccgg ctgaactggt gcggctgctg gacaggctgg aggaagaaat ccgggcagac 3360 

cgcgtgagca gtgaaagccg ggcgtggctt gcgcagtgcg ggctgacggt ggagcaactg 3420 

gccagacagg tggagccgga atacacaccg gcgcgaaaag ctcatcttta tcactgcgac 3480 

caccggggac tgccgctggc gcttatcagc gaagacggca atacggcgtg gagcgcggaa 3540 

tatgatgaat ggggcaacca gcttaatgag gagaacccgc atcatgtgta tcagccgtac 3600 

cgtctgccag ggcagcagca tgatgaggaa tcagggctgt actataaccg tcaccggtac 3660 

tacgatccgt tgcaggggcg gtatattact caggacccga tggggttgaa agggggatgg 3720 

aatttatatc agtatccttt aaatccacta caacaaattg accctatggg attattgcag 3780 

acttgggatg atgccagatc tggagcatgt acggggggag tttgtggtgt tctttcacgt 3840 

ataataggac caagtaaatt tgatagtact gcagatgctg cgttagatgc tttgaaagaa 3900 

acgcagaata gatctctatg taatgatatg gaatactctg gtattgtctg taaagatact 3960 

aatggaaaat attttgcatc taaggcagaa actgataatt taagaaagga gtcatatcct 4020 

ctgaaaagaa aatgtcccac aggtacagat agagttgctg cttatcatac tcacggtgca 4080 

gatagtcatg gcgattatgt tgatgaattt ttttcaagta gcgataaaaa tcttgtaaga 4140 

agtaaagata ataatcttga agcattttat ctcgcaacac ctgatggacg atttgaggcg 4200 

cttaataata aaggagaata tatttttatc agaaatagtg tcccgggatt gagttcagta 4260 

tgcataccgt atcatgatta a 4281 

<210> 181 
<211> 369 
<212> DNA 
<213> E. Coli 



<400> 181 

atgaaatata gttcaatatt ttcgatgctt tcatttttta tactatttgc ctgtaatgag 60 

acagctgttt acggttctga tgaaaacatt atttttatga ggtatgtgga aaaattacat 120 

ttagataaat actctgttaa aaatacggta aaaactgaaa caatggcgat acaattagct 180 

gaaatatatg ttaggtatcg ctatggcgaa cggattgcag aagaagaaaa accatattta 240 

attacggaac taccagatag ttgggttgtt gagggagcaa agttacctta tgaagttgcg 300 

ggtggtgtat ttattataga aattaataag aaaaatggat gtgttttgaa tttcctacat 360 

agtaaataa 359 

<210> 182 
<211> 711 
<212> DNA 
<213> E. Coli 

<400> 182 

atgctggcgc tgatggatgc ggatggaaac attgcgtgga gcggggagta tgatgagtgg 60 

ggcaaccagc tgaatgaaga gaacccgcat cacctgcacc agccgtaccg gctgccgggg 120 

cagcagtatg ataaggagtc ggggctgtac tacaaccgga accggtacta cgatccgttg 180 

caggggcggt atatcactca ggacccgata gggctggagg ggggatggag tctgtatgcg 24 0 

tatccgctga atccggtgaa tggtattgat ccattagggt taagtcccgc agatgtagcg 300 

ctaataagaa gaaaagatca actaaaccat caaagagcat gggatatatt atctgatact 360 

tatgaagata tgaagagatt aaatttaggt gggactgatc aatttttcca ttgtatggca 420 

ttttgtcgag tgtctaaatt aaatgacgct ggtgttagcc gatcggcgaa agggctgggt 480 

tatgaaaaag agattagaga ttacgggtta aatctgttcg gtatgtacgg cagaaaagta 540 

aagctatccc attctgaaat gattgaagat aataaaaaag acttggctgt aaatgaccat 600 

gggttgacat gtccatcaac aacagattgc tcagatagat gtagtgatta tattaatcca 660 

gagcataaaa aaacgataaa ggctttacaa gatgctggct atctcaagta a 711 

<210> 183 
<211> 261 
<212> DNA 
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<213> E. Coli 
<400> 183 

atgctggcta tctcaagtaa tctatcaaag atgataatat ttatttttgc tattataatc 60 

attgttgttt tatgcgtaat tacttatctt tatttataca aagatgaatc tcttgtaagt 120 

aaacattaca taaactatat ggcaatacca gaaaatgatg gagtttttac atggctccca ISO 

gatttttttc cgcacgtagc ggtggatata tcaacataca caaatgtaga agatgattat 240 



261 



ttttttctta tttttcccta a 

<210> 184 

<211> 192 

<212> DNA 

<213> E . Coli 

<400> 184 

gtgagggcca gggaacaagt ggcgaaaatc gtatcaaaga atgatccaga tacaaaaaaa 60 

gtgtggtgta aatatggtaa gataccaggg caaggggatg gtgtaaacct tttttttgtt 120 

ggtgaaatta atgttacgca ttattttata acaaatattg gagctggatt gcctgatgct 180 

tgtgcagagt aa ^92 

<210> 185 
<211> 504 
<212> DNA 
<213> E. Coli 

<400> 185 

atgccgggca acagcccgca ttatgggcgt tggcctcaac acgattttac gtcacttaaa 60 

aaactcaggc cgcagtcggt aacctcgcgc atacagccgg gcagtgacgt catcgtctgc 120 

gcggaaatgg acgaacagtg gggctatgtc ggggctaaat cgcgccagcg ctggctgttt 180 

tacgcgtatg acagtctccg gaagacggtt gttgcgcacg tattcggtga acgcactatg 240 

gcgacgctgg ggcgtcttat gagcctgctg ccaccctttg acgtggtgat atggatgacg 300 

gatggctggc cgctgtatga atcccgcctg aagggaaagc tgcacgtaat cagcaagcga 360 

tatacgcagc gaattgagcg gcataacctg aatctgaggc agcacctggc acggctggga 420 

cggaagtcgc tgtcgttctc aaaatcggtg gagctgcatg acaaagtcat cgggcattat 480 

ctgaacataa aacactatca ataa 504 

<210> 186 
<211> 276 
<212> DNA 
<213> E. Coli 

<400> 186 

gtggcttctg tttctatcag ctgtccctcc tgttcagcta ctgacggggt ggtgcgtaac 60 

ggcaaaagca ccgccggaca tcagcgctat ctctgctctc actgccgtaa aacatggcaa 120 

ctgcagttca cttacaccgc ttctcaaccc ggtacgcacc agaaaatcat tgatatggcc 180 

atgaatggcg ttggatgccg ggcaacagcc cgcattatgg gcgttggcct caacacgatt 240 

ttacgtcact taaaaaactc aggccgcagt cggtaa 276 

<210> 187 
<211> 417 
<212> DNA 
<213> E. Coli 

<400> 187 

atgatgacta aaacccaaat aaataaatta ataaaaatga tgaatgattt agactatcca 60 

tttgaagcac cgctcaagga atcatttatt gaaagtataa tccaaataga atttaattct 120 

aattcaacta attgcctgga gaagttatgt aatgaagtta gtattctttt taagaatcaa 180 

cctgattatc ttactttttt aagagcaatg gatggattcg aagttaatgg attacgatta 240 

tttagcctct cgattccaga accttcagtt aaaaaccttt ttgccgtaaa tgaattttat 300 

agaaataatg atgatttcat aaaccctgat ctacaagaac ggttagtgat cggggattat 360 

agcatttcaa tatttactta tgacattaaa ggtgatgctg ccaacttact gatttag 417 
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<210> 188 
<211> 1179 
<212> DNA 
<213> E. Coli 

<400> 188 

atgagtaata ttgtttacct gacagtaacg ggagaacaac aaggaagcat ctccgcaggt 60 

tgtgggactt ctgagtctac aggtaatcgt tggcagagcg ggcatgagga tgaaatattt 120 

acattctcac tcttaaataa tattaataat acggggcttg gttcacagtt ccatggtata 180 

acattttgta aattaattga taaaagcact ccattattta ttaattccat taacaataat 240 

gaacaattat ttatgggatt tgacttctat cgaataaata gatttggtag attggaaaag 300 

tattattata tacaactaag aggcgctttt ttatcggcta ttcatcacca gatcattgaa 360 

aaccaactgg atacagaaac aataactatt agttatgaat ttatcctctg tcaacatctt 420 

atcgcaaata ccgagttcag ctatttggca ctccctgaaa attataaccg tttgttttta 480 

ccaaattcaa aaaaccaaac aaataatcgt ttcaaaacgt taaacagcaa agctattggc 540 

aggctacttg ctgctggtgg cgtatacaat gggaacattg aaggattcag agatactgcg 600 

gaaaaactgg gtggagatgc aataaaaggc tatgatcaaa tactaaatga aaaaacagcg 660 

ggcatagcga tagcaacagc atctattctt ttaacaaagc gttctaatgt tgatacatat 720 

acagaaataa atagttactt aggcaaactt agaggtcaac aaaaacttct tgatggtata 780 

gacataatag aaataatata cattaagaga ccttcaaaag acttagctaa cttacgaaag 840 

gagtttaata aaactgtaag aaaaaatttt cttatcaaac ttgcaaaaac ctccgaagca 900 

tctggaagat tcaacgccga agacctttta agaatgagaa agggcaatgt tcctctaaat 960 

tataatgttc accataaact atctctagat gatggtggta ctaatgattt cgaaaattta 1020 

gtattaatcg aaaacgaacc atatcataaa gtttttacta acatgcaatc acgaatagct 1080 

aagggaatat tagtaggtga aagcaaaatc actccctggg ccattccatc tggctcaatt 1140 

tatcctccca tgaaaaatat tatggaccac acaaaatga 1179 



<210> 189 
<211> 666 
<212> DNA 
<213> E. Coli 



<400> 189 

atggtacttg ctttgaacta taatatgcac ggagttaata ttcgctcaga gaatgcagca 60 

aaacctcata cgatgccctc tagatatctt tgcgagtata ttagaagcat tgagaaaaat 120 

ggccacgccc ttgattttgg ctgcggaaaa cttagatatt ctgatgaatt aatcagtaaa 180 

tttgatgaag ttacttttct agactcgaaa aggcaacttg aaagagagca aattattaga 240 

ggaattaaaa ctaaaattat tgactatgtc ccacgatatt ataaaaatgc aaatacagtt 300 

gctttcgagg atgtcgacaa aataattggg ggttacgatt tcatcctttg ctctaatgtt 360 

ctctctgccg ttccttgtcg ggatacaatc gacaaaatag ttcttagcat caagagatta 420 

ctaaaatcag gaggtgagac tcttattgta aatcaatata aaagctcata cttcaaaaaa 480 

tacgaaacag gaagaaaaca tctttacgga tacatttaca aaaattcaaa aagtgtttct 540 

tactatggat tactcgatga actcgcagtg caagaaatat gttcttcaca tggccttgaa 600 

atattaaagt cgtggagtaa agcaggaagt tcatatgtca ctgttgggag ttgtaatgca 660 

atataa 666 

<210> 190 
<211> 705 
<212> DNA 
<213> E . Coli 



<400> 190 

gtgaataata tgttcgaacc ccccaaaaat tataatgaaa tgttgcctaa acttcataaa 60 
gcaactttct taaatacgct aatatattgc atacttctag ttatttacga atacatccct • - 120 

ttaataacat taccaaccaa gtatgtccca cctattaaag atcatgagag ctttattaat 180 

tgggcactat cttttggtat attaccttgt gcttttgcga tttttgcata tttaattagc 240 

ggtgcgttag acctacataa caatgcagcc aaactacttc gggtgcgata tctttgggat 300 

aagcatctaa ttataaaacc gttatcacgg agagctggag tcaacagaaa attaaataaa 360 

gatgaagctc acaatgtaat gagcaatcta tattaccctg aagtaagaaa aattgaagac 420 

aaacattata ttgaactctt ctggaataaa gtatactatt tttggatatt ttttgaattt 480 



-64- 



WO 00/44906 



PCT/USOO/02200 



tcgataattg cattaatttc cttcctaata atcttttttt gcaaacaaat ggatattttt 540 

catgttgaag gttctttgct gtctttattc ttttttgtaa ttttatcatt ctcagtgagt 600 

ggtattatct ttgctttgac agttaagccc agaactgaaa gtcaagtcgg aaaaatcccg 660 

gacgataaaa taaaagaatt tttcactaaa aataacatta attga 705 

<210> 191 

<211> 285 

<212> DNA 

<213> E . Coli 

<400> 191 
atgtttacta tcaacgcaga 
cgtgccgcta acaagttccc 
gagctggatc acgacaaagt 
ctgaccatcg ttgttgacgg 
ccgtacaaac cgaagctgca 

<210> 192 

<211> 1977 

<212> DNA 

<213> E. Coli 

<400> 192 

atggtattgt tttatcgggc acactggcgc gactataaaa acgatcaagt gaggatcatg 60 

atgaatctga cgactctgac ccaccgcgat gcgttgtgtc tgaatgcgcg ctttaccagc 120 

cgtgaagagg ccatccacgc gttgactcaa cgtcttgctg ctctggggaa aatttccagt 180 

actgagcaat ttctggaaga agtgtatcgc cgtgaaagcc ttggcccgac cgccttaggt 240 

gaagggttgg ctgtgccgca tggcaaaact gctgcggtaa aagaagcggc gtttgcggtc 300 

gccacactca gcgagccgct tcagtgggaa ggcgttgatg gcccggaagc agttgattta 360 

gtggtgctgc tggcgattcc ccccaatgaa gcgggtacta cgcatatgca actgctgaca 420 

gcgctgacca cgcgccttgc ggatgatgag attcgggcgc gtatacagtc ggcgacgacg 480 

cctgatgagt tgctctcggc gctggatgac aagggaggca cgcaaccttc tgcctctttt 540 

tccaacgcgc caactatcgt ctgcgtaacg gcctgtccgg cgggtattgc tcacacctat 600 

atggctgcgg aatatctgga aaaagccgga cgcaaactcg gcgtaaatgt ttacgttgaa 660 

aaacaaggcg ctaacggcat tgaagggcgt ttaacggcgg atcaactcaa tagtgcaacc 720 

gcctgtattt ttgcggctga agtcgccatc aaggagagtg agcgttttaa tggcattccc 780 

gcgctttcag tgcctgttgc cgagccgatt cgccatgcag aagcgttgat ccaacaagcg 840 

cttaccctca agcgtagcga tgagacgcgt accgtacagc aagatacgca accggtgaaa 900 

agtgtcaaaa cggagctgaa acaggcactg ttgagcggaa tctcttttgc cgtaccgttg 960 

attgtcgcgg ggggcacggt gctggcggtc gcggtattac tgtcgcaaat cttcgggcta 1020 

caagatctgt ttaatgaaga aaactcctgg ctgtggatgt accgcaagct gggcggcggg 1080 

ctgctcggaa ttttgatggt accggtgctc gcggcctata ccgcctattc tctggcagat 114 0 

aaaccggcgt tagcgccagg ctttgcggct ggacttgccg ccaacatgat cggctccggg 1200 

tttctcggcg cggtcgttgg cggattgata gccggttact tgatgcgctg ggtgaaaaat 1260 

cacttgcgtc ttagcagtaa attcaatgga ttcctgactt tttatctcta cccggtgctc 1320 

ggtacgttgg gagcgggcag tctgatgctg tttgtggtgg gggaacctgt cgcctggatc 1380 

aataactcgc ttaccgcctg gctgaacggt ctgtcaggaa gtaacgcgct gttgctgggt 14 4 0 

gccattctcg gttttatgtg ttcctttgac cttggagggc cagtgaataa agccgcttat 1500 

gcattctgcc tgggcgcaat ggcgaacggc gtttacggcc cgtatgccat tttcgcctcc 1560 

gtcaaaatgg tttcggcatt taccgtaacc gcttccacga tgctcgcacc gcgcctgttt 1620 

aaagagtttg aaattgagac cgggaaatcc acctggctgt tagggctggc aggtattacc 1680 

gaaggggcga tcccgatggc gattgaagat ccgctgcggg ttattggttc gtttgtgctg 1740 

ggctctatgg taacgggcgc tattgtcggt gcgatgaata tcggcctttc gacacccggt 1800 

gccggcattt tctcgctctt tttacttcat gataatggcg cgggcggtgt tatggcggca 1860 

attggctggt ttggcgcggc attggtgggg gctgcaatct cgactgcaat tctcctgatg 1920 

tggcggcgtc acgcggttaa gcatggcaac tatctgactg atggcgtaat gccataa 1977 

<210> 193 
<211> 2634 
<212> DNA 
<213> E . Coli 



agtacgtaaa gagcagggta agggtgcgag ccgccgcctg 60 

ggcaatcatc tacggtggca aagaagcgcc gctggctatc 120 

catgaacatg caagctaaag ctgaattcta cagcgaagtt 180 

taaagaaatc aaagttaaag ctcaggacgt acagcgtcac 240 

gcacatcgac ttcgttcgcg cttaa 285 
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<400> 193 

atgaaagcag tatctcgcgt tcacatcacc ccgcatatgc actgggatcg agagtggtat 60 

ttcaccaccg aagagtcacg tattctgctg gtcaataata tggaagagat cctgtgccga 120 

ctggaacagg acaacgaata caaatattac gtactcgacg ggcaaacggc gatcctcgaa 180 

gattatttcg cggtgaaacc ggaaaacaaa gaccgtgtga agaaacaggt agaagccggc 240 

aagtcgatta tcggcccctg gtatacccag accgatacca cgattgtttc tgcggaatcc 300 

atcgtccgta atctgatgta cggaatgcgt gactgcctcg cgtttggcga gccgatgaaa 360 

ataggttatt taccagattc ctttggcatg tccgggcaac tgccgcatat ctacaatgga 420 

tttggcatta cccgcaccat gttctggcgc ggatgttcgg agcgccacgg tactgataaa 4 80 

accgagtttt tgtggcaaag cagtgacggt agcgaagtga cggcgcaggt gctgccgctg 540 

ggctacgcca tcggtaagta cttacctgcc gacgaaaacg gattacgtaa acgcctcgac 600 

agttattttg acgtgctgga aaaagcgtct gtaaccaaag agattttgct gccgaatggg 660 

catgaccaga tgccattgca gcaaaatatc ttcgaagtga tggataagct acgtgagatc 720 

taccctcaac gtaagtttgt gatgagccgc tttgaagagg tatttgagaa gatcgaagcg 780 

cagcgagata atctggcaac cctgaaaggg gaatttattg atggcaaata tatgcgcgtg 840 

catcgcacca tcggttctac gcgtatggat atcaaaattg cccacgcgcg tattgaaaat 900 

aagattgtta atctgctgga accgctggca acactggcct ggacgttggg ttttgaatac 960 

caccacggct tgctggagaa aatgtggaaa gagatcttaa aaaatcatgc ccacgacagt 1020 

atcggctgct gctgtagtga caaagttcat cgcgaaatcg tcgcccgctt cgaactggct 1080 

gaagacatgg cggataatct gattcgtttc tacatgcgca aaattgccga caacatgccg 1140 

cagagcgacg ccgacaaact cgtcctgttt aacctgatgc cctggccgcg tgaagaagtt 1200 

atcaacacca ctgtgcggct gcgcgccagc cagtttaatt tgcgggacga tcgcggtcag 1260 

cctgtaccgt attttattcg ccatgcccgt gagatcgatc caggcctaat cgatcggcaa 1320 

atagttcatt acggtaatta cgatcccttt atggagtttg atatacagat caaccagatt 1380 

gtcccttcta tgggctatcg cacgctttat atcgaagcga atcagcctgg caacgtaatt 14 40 

gcggcaaaaa gtgacgctga agggatactg gaaaatgctt tctggcaaat tgcgctcaat 1500 

gaggatggtt ctctgcaact ggtagataaa gacagcggtg tgcgctatga ccgggtattg 1560 

caaattgaag aaagctctga tgatggtgat gaatatgact attcacccgc aaaagaagag 1620 

tgggtaatta ccgcagcgaa cgcgaaaccg caatgcgata ttattcatga agcctggcag 1680 

agcagggctg ttatccgcta tgacatggca gtgccgctca atttgtcaga acgcagcgcc 174 0 

cggcaatcca ctggcagagt aggggtggtg ttggttgtca ctcttagtca taacagcagg 1800 

cgtattgatg tggatatcaa tcttgataac caggctgacg atcatcgcct tcgtgtcctg 1860 

gtccctacac cttttaacac cgacagtgtt ctggcagata cgcagtttgg ttcgctaacg 1920 

cgccccgtga acgacagtgc aatgaacaac tggcagcaag aaggctggaa agaagcgccg 1980 

gttccggtat ggaatatgct caactatgtt gccttacagg aagggcgtaa cggcatggct 2040 

gtctttagcg aagggttacg tgaatttgaa gtcatcggtg aagagaagaa aacctttgcc 2100 

attacgttgc tgcgtggcgt gggcttactg ggcaaagaag atctgctttt aaggcctggg 2160 

cggccttcgg gaattaaaat gccagtcccg gactcacaac tacgtggtct gctttcttgt 2220 

cgcctaagtt tattgagtta taccggtacg ccaaccgccg ctggtgtagc tcagcaggcg 2280 

cgagcatggc tgactccagt acagtgttac aacaaaatcc catgggatgt gatgaagctc 2340 

aacaaagccg gattcaacgt gccggaaagt tatagtttgt tgaaaatgcc cccagtggga 2400 

tgcctgataa gcgcacttaa gaaagctgaa gaccgacaag aagtgatttt acggctgttt 24 60 

aatccggctg aatcagcaac ctgtgatgcg actgttgctt tcagtcgcga ggtgatttct 2520 

tgctcagaaa cgatgatgga tgaacacatt accaccgagg aaaatcaagg ttcaaatcta 2580 

tcggggcctt ttttacccgg ccagtcacgg acgttcagtt accggcttgc ctga 2634 

<210> 194 
<211> 1572 
<212> DNA 
<213> E. Coli 

<400> 194 

atgatgttag atatagtcga actgtcgcgc ttacagtttg ccttgaccgc gatgtaccac 60 

ttcctttttg tgccactgac gctcggtatg gcgttcctgc tggccattat ggaaacggtc 120 

tacgtcctct ccggcaaaca gatttataaa gatatgacca agttctgggg caagttgttt 180 

ggtatcaact tcgctctggg tgtggctacc ggtctgacca tggagttcca gttcgggact 240 

aactggtctt actattccca ctatgtaggg gatatcttcg gtgcgccgct ggcaatcgaa 300 

ggtctgatgg ccttcttcct cgaatccacc tttgtaggtc tgttcttctt cggttgggat 360 

cgtctgggta aagttcagca tatgtgtgtc acctggctgg tggcgctcgg ttcaaacctg 4 20 

tccgcactgt ggattctggt tgcgaacggc tggatgcaaa acccaatcgc gtccgatttc 480 
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aactttgaaa 
gctcaggtga 
ctcggtatca 
tttgctatcg 
gaatccggct 
tgggaaacgc 
gagacgaaca 
gtggataccc 
aacgggatga 
gttcgtgacc 
tatacgccaa 
atcccgcgtg 
cttctggcaa 
aaatggcttc 
ggctggttcg 
gctgtggcga 
ggcctgtata 
ggcccaagca 
ccggcacgct 



ctatgcgtat 
aattcgttca 
gcgcatggta 
ctgccagctt 
acgaaatggg 
aacctgcgcc 
aatttgcgat 
cggttatcgg 
aggcgtactc 
agttcaatag 
acgtggctga 
tagcgccgct 
tcatcgcgct 
tgcgcgccgc 
tggctgaata 
actcgtcact 
ccctgttcct 
gcctgaaaac 
aa 



ggagatggtg 
cactgtagcg 
tatgctgaaa 
cggtatggct 
cgacgtgcag 
tgctgccttt 
tcagatccct 
cctgaaagag 
tctgctcgaa 
catgaagaaa 
tgcgactgaa 
gtactttgcg 
ctctttctgg 
gctgtacggt 
tggccgccaa 
gaccgcaggc 
ggtggcagaa 
cggtcgctat 



agcttctccg 
tctggttatg 
ggtcgtgact 
gctgttctgt 
aaaaccaaac 
actctgttcg 
tacgcactgg 
ctgatggtgc 
caactgcgtt 
gacctcggtt 
gcgcagattc 
ttccgtatca 
agtgtcatcc 
attccgctgc 
ccgtgggcta 
gatctcatct 
ttgttcttaa 
cactttgagc 



agctggtgct 
tgactggcgc 
tcgccttcgc 
ctgttattgt 
tggctgctat 
gcattcctga 
gcatcattgc 
agcatgaaga 
ctggttctac 
acggtctgct 
aacaggcaac 
tggtggcgtg 
gcaaccgcat 
cgtggattgc 
tcggtgaagt 
tctcaatggt 
tgttcaagtt 
agtcttccac 



taacccggtt 
gatgttcatc 
taaacgctcc 
tctgggtgat 
tgaagccgag 
tcaggaagag 
aacgcgttcc 
acgcattcgt 
cgaccaggcg 
gctgaaacgc 
caaagactcc 
tggcttcctg 
tggcgagaaa 
tgtagaagcg 
gctgccgaca 
gctgatttgc 
tgcacgcctc 
gactactcag 



540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1572 



<210> 195 
<211> 1140 
<212> DNA 
<213> E. Coli 



<400> 195 

atgatcgatt atgaagtatt gcgttttatc tggtggctgc tggttggcgt tctgctgatt 60 

ggttttgcag tcactgacgg tttcgacatg ggggtgggca tgctcacccg tttcctcggt 120 

cgtaacgaca ccgagcgtcg aattatgatt aactccattg caccacactg ggacggtaac 180 

caggtttggc tgatcaccgc gggcggcgca ctctttgctg cctggccgat ggtctatgcc 24 0 

gctgcgttct ccggcttcta tgtggcgatg atcctcgtgc tggcgrcttt gttcttccgt 300 

ccggtcggtt ttgactaccg ctccaagatt gaagaaaccc gctggcgtaa catgtgggac 360 

tggggcatct tcattggtag cttcgttccg ccgctggtaa ttggtgtagc gttcggtaac 420 

ctgttgcagg gcgtaccgtt caacgttgat gaatatctgc gtctgtacta caccggtaac 4 80 

ttcttccagt tgcttaaccc gttcggcctg ctggcaggcg tggtgagcgt agggatgatc 540 

attactcagg gcgcaaccta tctgcaaatg cgtaccgtgg gcgaactgca cctgcgtacc 600 

cgtgcaacgg ctcaggtggc tgcgctggtg acactggtct gtttcgcact ggctggcgta 660 

tgggtgatgt acggtatcga tggttatgtc gtgaaatcga caatggacca ttacgcagcc 720 

tctaacccac tgaataaaga agtggttcgt gaagctggcg catggctggt taacttcaac 780 

aacacgccaa ttctgtgggc tattccggca ctgggtgtgg ttctgccgct gctgaccatc 840 

ctgactgcac gtatggataa agccgcgtgg gcgtttgtgt tctcctccct gacgctggcc 900 

tgcatcatcc tgacagccgg tatcgcaatg ttcccgtttg tgatgccgtc cagcaccatg 960 

atgaacgcaa gtctgacaat gtgggatgca acttccagcc agctgacgct taacgtcatg 1020 

acctgggttg cggtggttct ggtaccgatc attctgctct acaccgcctg gtgttactgg 1080 

aaaatgttcg gtcgtatcac caaagaagat attgaacgta acacccactc tctgtactaa 1140 

<210> 196 
<211> 1371 
<212> DNA 
<213> E. Coli 



<400> 196 

atggaattat cctcactgac cgccgtttcc cctgtcgatg gacgctacgg cgataaagtc 60 

agcgcgctgc gcgggatttt cagcgaatat ggtttgctga aattccgtgt acaagttgaa 120 

gtacgttggc tgcaaaaact ggccgcgcac gcagcgatca aggaagttcc tgcttttgct 180 

gccgacgcaa tcggttacct tgatgcaatc gtcgccagtt tcagcgaaga agatgcggcg 24 0 

cgcatcaaaa ctatcgagcg taccactaac cacgacgtta aagcggttga gtatttcctg 300 

aaagaaaaag tggcggagat cccggaactg cacgcggttt ctgaattcat ccactttgcc 360 

tgtacttcgg aagatatcaa taacctctcc cacgcattaa tgctgaaaac cgcgcgtgat 420 

gaagtgatcc tgccatactg gcgtcaactg attgatggca ttaaagatct cgccgttcag 480 

tatcgcgata tcccgctgct gtctcgtacc cacggtcagc cagccacgcc gtcaaccatc 54 0 
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ggtaaagaga 
gtggagatcc 
tacccggaag 
tggaacccgt 
gttgcgcgct 
cttaaccact 
aaagttaacc 
ttgcagcatc 
accgtgctgc 
ctgaaaggcg 
aactgggaag 
ccgtacgaga 
cagtttatcg 
ccggctaact 



tggcaaacgt 
tcggcaaaat 
ttgactggca 
acaccaccca 
tcaacactat 
tcaaacagaa 
cgatcgactt 
tggcaagcaa 
gtaacctcgg 
tgagcaaact 
tgctggctga 
agctgaaaga 
atggtctggc 
atattggtcg 



cgcctaccgt 
caacggcgcg 
tcagttcagc 
gatcgaaccg 
tctgatcgac 
aaccattgct 
cgaaaactcc 
actgccggtt 
cgtgggtatc 
ggaagtgaac 
accaatccag 
gctgactcgc 
gttgccagaa 
agctatcacg 



atggagcgcc 
gtcggtaact 
gaagagttcg 
cacgactaca 
tttgaccgtg 
ggtgagattg 
gaagggaatc 
tcccgctggc 
ggttatgcct 
cgtgaccatc 
acagttatgc 
ggtaagcgcg 
gaagagaaag 
atggttgatg 



agtaccgcca 
ataacgccca 
tcacctcgct 
ttgccgaact 
acgtctgggg 
gttcttccac 
tgggcctttc 
agcgtgacct 
tgattgcata 
tgctggatga 
gtcgctatgg 
ttgacgccga 
cccgcctgaa 
agctgaaata 



gcttaaccag 
catcgccgct 
gggtactcag 
gtttgattgc 
ttatatcgcc 
catgccgcat 
caacgcggta 
gaccgactct 
tcaatccacc 
actggatcac 
catcgaaaaa 
aggcatgaag 
agcgatgacg 



<210> 197 
<211> 186 
<212> DNA 
<213> E. Coli 

<400> 197 

atgctgattc tgactcgtcg agttggtgag accctcatga ttggggatga ggtcaccgtg 
acagttttag gggtaaaggg caaccaggta cgtattggcg taaatgcccc gaaggaagtt 
tctgttcacc gtgaagagat ctaccagcgt atccaggctg aaaaatccca gcagtccagt 
tactaa 

<210> 198 
<211> 93 
<212> DNA 
<213> E. Coli 

<400> 198 

ggtgaggtgg ccgagaggct gaaggcgctc ccctgctaag ggagtatgcg gtcaaaagct 
gcatccgggg ttcgaatccc cgcctcaccg cca 

<210> 199 

<211> 603 

<212> DNA 

<213> E. Coli 



600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1371 



60 
120 
180 
186 



-60 
93 



<400> 199 
atgaagaata aggctgataa 
ctccttaaag cagcaaatac 
tgttttattc atggtttccg 
cttaaggcaa agtgtatata 
ctattgaata aagaagttca 
catgctgaga gcgagtgggt 
ttttaccata ttatctcgac 
cacatgttac gccattcgtg 
atccaggatt atcttgggca 
gcagggcgtt tttacggcat 
tag 

<210> 200 
<211> 597 
<212> DNA 
<213> E. Coli 



<400> 200 

gtgagtaaac gtcgttatct taccggtaaa gaagttcagg ccatgatgca ggcggtttgt 
tacggggcaa cgggagccag agattattgt cttattctgt tggcatatcg gcatgggatg 



caaaaaaagg 
cgggcctcat 
ggcgagtgaa 
tatccatcga 
ggctttaaaa 
atttttatca 
ttccggtggt 
tggttttgct 
tcgcaatatt 
ctgggataga 



aacttcctga 
gcagcacgta 
atttgtcgat 
ttaaaaaaag 
aactggttga 
cgtaagggga 
aatgccgggt 
ttggcgaata 
cgtcatactg 
gccagaggac 



cccatagtga 
attattgtct 
tgaggatttc 
gcttttcaac 
gtatccgtac 
atccgctttc 
tgtcactgga 
tgggaataga 
tctggtatac 
gacagcgtca 



aatcgaatca 
gactttgctt 
ggatattgat 
aacgcacccg 
ttcgtacccg 
tcggcaacag 
gattcatccg 
tacgcgactt 
cgccagcaat 
cgctgtttta 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
603 



60 
120 
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cgtattagtg aactgcttga tctgcattat caggaccttg accttaatga aggtagaata 180 

aatattcgcc gactgaagaa cggattttct accgttcacc cgttacgttt tgatgagcgt 240 

gaagccgtgg aacgctggac ccaggaacgt gctaactgga aaggcgctga ccggactgac 300 

gctatattta tttctcgccg cgggagtcgg ctttctcgcc agcaggccta tcgcattatt 360 

cgcgatgccg gtattgaagc tggaaccgta acgcagactc atcctcatat gttaaggcat 420 

gcttgcggtt atgaattggc ggagcgtggt gcagatactc gtttaattca ggattatctc 480 

gggcatcgaa atattcgcca tactgtgcgt tataccgcca qtaatgctgc tcgttttgcc 540 

ggattatggg aaagaaataa tctcataaac gaaaaattaa aaagagaaga ggtttga 597 

<210> 201 
<211> 549 
<212> DNA 
<213> E. Coli 

<400> 201 

atgaaaatta aaactctggc aatcgttgtt ctgtcggctc tgtccctcag ttctacagcg 60 

gctctggccg ctgccacgac ggttaatggt gggaccgttc actttaaagg ggaagttgtt 120 

aacgccgctt gcgcagttga tgcaggctct gttgatcaaa ccgttcagtt aggacaggtt 180 

cgtaccgcat cgctggcaca ggaaggagca accagttctg ctgtcggttt taacattcag 240 

ctgaatgatt gcgataccaa tgttgcatct aaagccgctg ttgccttttt aggtacggcg 300 

attgatgcgg gtcataccaa cgttctggct ctgcagagtt cagctgcggg tagcgcaaca 360 

aacgttggtg tgcagatcct ggacagaacg ggtgctgcgc tgacgctgga tggtgcgaca 420 

tttagttcag aaacaaccct gaataacgga accaatacca ttccgttcca ggcgcgttat 480 

tttgcaaccg gggccgcaac cccgggtgct gctaatgcgg atgcgacctt caaggttcag 540 

tatcaataa " 549 

<210> 202 
<211> 648 
<212> DNA 
<213> E. Coli 

<400> 202 

gtgctgctaa tgcggatgcg accttcaagg ttcagtatca ataacctacc caggttcagg 60 

gacgtcatta cgggcaggga tgcccaccct tgtgcgataa aaataacgat gaaaaggaag 120 

agattatttc tattagcgtc gttgctgcca atgtttgctc tggccggaaa taaatggaat 180 

accacgttgc ccggcggaaa tatgcaattt cagggcgtca ttattgcgga aacttgccgg 24 0 

attgaagccg gtgataaaca aatgacggtc aatatggggc aaatcagcag taaccggttt 300 

catgcggttg gggaagatag cgcaccggtg ccttttgtta ttcatttacg ggaatgtagc 360 

acggtggtga gtgaacgtgt aggtgtggcg tttcacggtg tcgcggatgg taaaaatccg 420 

gatgtgcttt ccgtgggaga ggggccaggg atagccacca atattggcgt agcgttgttt 480 

gatgatgaag gaaacctcgt accgattaat cgtcctccag caaactggaa acggctttat 540 

tcaggctcta cttcgctaca tttcatcgcc aaatatcgtg ctaccgggcg tcgggttact 600 

ggcggcatcg ccaatgccca ggcctggttc tctttaacct atcagtaa 64 8 

<210> 203 
<211> 726 
<212> DNA 
<213> E . Coli 

<400> 203 

gtgagtaata aaaacgtcaa tgtaaggaaa tcgcaggaaa taacattctg cttgctggca 60 

ggtatcctga tgttcatggc aatgatggtt gccggacgcg ctgaagcggg agtggcctta 120 

ggtgcgactc gcgtaattta tccggcaggg caaaaacaag agcaacttgc cgtgacaaat 180 

aatgatgaaa atagtaccta tttaattcaa tcatgggtgg aaaatgccga tggtgtaaag 240 

gatggtcgtt ttatcgtgac gcctcctctg tttgcgatga agggaaaaaa agagaatacc 300 

ttacgtattc ttgatgcaac aaataaccaa ttgccacagg accgggaaag tttattctgg 360 

atgaacgtta aagcgattcc gtcaatggat aaatcaaaat tgactgagaa tacgctacag 420 

ctcgcaatta tcagccgcat taaactgtac tatcgcccgg ctaaattagc gttgccaccc 480 

gatcaggccg cagaaaaatt aagatttcgt cgtagcgcga attctctgac gctgattaac 540 

ccgacaccct attacctgac ggtaacagag ttgaatgccg gaacccgggt tcttgaaaat 600 

gcattggtgc ctccaatggg cgaaagcacg gttaaattgc cttctgatgc aggaagcaat 660 
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attacttacc gaacaataaa 
gaataa 



tgattatggc gcacttaccc ccaaaatgac gggcgtaatg 



720 
726 



<210> 204 
<211> 2637 
<212> DNA 
<213> £. Coli 

<400> 204 

atgtcatatc tgaatttaag actttaccag cgaaacacac aatgcttgca tattcgtaag 60 

catcgtttgg ctggtttttt tgtccgactc gttgtcgcct gtgcttttgc cgcacaggca 120 

cctttgtcat ctgccgacct ctattttaat ccgcgctttt tagcggatga tccccaggct 180 

gtggccgatt tatcgcgttt tgaaaatggg caagaattac cgccagggac gtatcgcgtc 24 0 

gatatctatt tgaataatgg ttatatggca acgcgtgatg tcacatttaa tacgggcgac 300 

agtgaacaag ggattgttcc ctgcctgaca cgcgcgcaac tcgccagtat ggggctgaat 360 

acggcttctg tcgccggtat gaatctgctg gcggatgatg cctgtgtgcc attaaccaca 420 

atggtccagg acgctactgc gcatctggat gttggtcagc agcgactgaa cctgacgatc 480 

cctcaggcat ttatgagtaa tcgcgcgcgt ggttatattc ctcctgagtt atgggatccc 540 

ggtattaatg ccggattgct caattataat ttcagcggaa atagtgtaca gaatcggatt 600 

gggggtaaca gccattatgc atatttaaac ctacagagtg ggttaaatat tggtgcgtgg 660 

cgtttacgcg acaataccac ctggagttat aacagtagcg acagatcatc aggtagcaaa 720 

aataaatggc agcatatcaa tacctggctt gagcgagaca taataccgtt acgttcccgg 780 

ctgacgctgg gtgatggtta tactcagggc gatattttcg atggtattaa ctttcgcggc 840 

gcacaattgg cctcagatga caatatgtta cccgatagtc aaagaggatt tgccccggtg 900 

atccacggta ttgctcgtgg tactgcacag gtcactatta aacaaaatgg gtatgacatt 960 

tataatagta cggtgccacc ggggcctttt accatcaacg atatctatgc cgcaggtaat 1020 

agtggtgact tgcaggtaac gatcaaagag gctgacggca gcacgcagat ttttaccgta 1080 

ccctattcgt cagtcccgct tttgcaacgt gaagggcata ctcgttattc cattacggca 1140 

ggagaatacc gtagtggaaa tgcgcagcag gaaaaaaccc gctttttcca gagtacatta 1200 

ctccacggcc ttccggctgg ctggacaata tatggtggaa cgcaactggc ggatcgttat 1260 

cgtgctttta atttcggtat cgggaaaaac atgggggcac tgggcgctct gtctgtggat 1320 

atgacgcagg ctaattccac acttcccgat gacagtcagc atgacggaca atcggtgcgt 1380 

tttctctata acaaatcgct caatgaatca ggcacgaata ttcagttagt gggttaccgt 1440 

tattcgacca gcggatattt taatttcgct gatacaacat acagtcgaat gaatggctac 1500 

aacattgaaa cacaggacgg agttattcag gttaagccga aattcaccga ctattacaac 1560 

ctcgcttata acaaacgcgg gaaattacaa ctcaccgtta ctcagcaact cgggcgcaca 1620 

tcaacactgt atttgagtgg tagccatcaa acttattggg gaacgagtaa tgtcgatgag 1680 

caattccagg ctggattaaa tactgcgttc gaagatatca actggacgct cagctatagc 1740 

ctgacgaaaa acgcctggca aaaaggacgg gatcagatgt tagcgcttaa cgtcaatatt 1800 

cctttcagcc actggctgcg ttctgacagt aaatctcagt ggcgacatgc cagtgccagc 1860 

tacagcatgt cacacgatct caacggtcgg atgaccaatc tggctggtgt atacggtacg 1920 

ttgctggaag acaacaacct cagctatagc gtgcaaaccg gctatgccgg gggaggcgat .1980 

ggaaatagcg gaagtacagg ctacgccacg ctgaattatc gcggtggtta cggcaatgcc 2040 

aatatcggtt acagccatag cgatgatatt aagcagctct attacggagt cagcggtggg 2100 

gtactggctc atgccaatgg cgtaacgctg gggcagccgt taaacgatac ggtggtgctt 2160 

gttaaagcgc ctggcgcaaa agatgcaaaa gtcgaaaacc agacgggggt gcgtaccgac 2220 

tggcgtggtt atgccgtgct gccttatgcc actgaatatc gggaaaatag agtggcgctg 2280 

gataccaata ccctggctga taacgtcgat ttagataacg cggttgctaa cgttgttccc 2340 

actcgtgggg cgatcgtgcg agcagagttt aaagcgcgcg ttgggataaa actgctcatg 2400 

acgctgaccc acaataataa gccgctgccg tttggggcga tggtgacatc agagagtagc 2460 

cagagtagcg gcattgttgc ggataatggt caggtttacc tcagcggaat gcctttagcg 2520 

ggaaaagttc aggtgaaatg gggagaagag gaaaatgctc actgtgtcgc caattatcaa 2580 

ctgccaccag agagtcagca gcagttatta acccagctat cagctgaatg tcgttaa 2637 

<210> 205 

<211> 531 
<212> DNA 
<213> E. Coli 

<400> 205 

atgagaaaca aaccttttta tcttctgtgc gcttttttgt ggctggcggt gagtcacgct 60 
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ttggctgcgg atagcacgat tactatccgc ggctatgtca gggataacgg ctgtagtgtg 120 

gccgctgaat caaccaattt tactgttgat ctgatggaaa acgcggcgaa gcaatttaac 180 

aacattggcg cgacgactcc tgttgttcca tttcgtattt tgctgtcacc ctgtggtaat 240 

gccgtttctg ccgtaaaggt tgggtttact ggcgttgcag atagccacaa tgccaacctg 300 

cttgcacttg aaaatacggt gtcagcggct tcgggactgg gaatacagct tctgaatgag 360 

cagcaaaatc aaatacccct taatgctcca tcgtccgcgc tttcgtggac gaccctgacg 420 

ccgggtaaac caaatacgct gaatttttac gcccggctaa tggcgacaca ggtgcctgtc 480 

actgcggggc atatcaatgc cacggctacc ttcactcttg aatatcagta a 531 

<210> 206 
<211> 504 
<212> DNA 
<213> E. Coli 



<400> 206 

atgaaatggt gcaaacgtgg gtatgtattg gcggcaatat tggcgctcgc aagtgcgacg 60 

atacaggcag ccgatgtcac catcacggtg aacggtaagg tcgtcgccaa accgtgtacg 120 

gtttccacca ccaatgccac ggttgatctc ggcgatcttt attctttcag tcttatgtct 180 

gccggggcgg catcggcctg gcatgatgtt gcgcttgagt tgactaattg tccggtggga 240 

acgtcgaggg tcactgccag cttcagcggg gcagccgaca gtaccggata ttataaaaac 300 

caggggaccg cgcaaaacat ccagttagag ctacaggatg acagtggcaa cacattgaat 360 

actggcgcaa ccaaaacagt tcaggtggat gattcctcac aatcagcgca cttcccgtta 420 

caggtcagag cattgacagt aaatggcgga gccactcagg gaaccattca ggcagtgatt 4 80 

agcatcacct atacctacag ctga 504 

<210> 207 

<211> 903 

<212> DNA 

<213> E. Coli 



<400> 207 

atgaaacgag ttattaccct gtttgctgta ctgctgatgg gctggtcggt aaatgcctgg 60 

tcattcgcct gtaaaaccgc caatggtacc gctatcccta ttggcggtgg cagcgccaat 120 

gtttatgtaa accttgcgcc cgtcgtgaat gtggggcaaa acctggtcgt ggatctttcg 180 

acgcaaatct tttgccataa cgattatccg gaaaccatta cagactatgt cacactgcaa 240 

cgaggctcgg cttatggcgg cgtgttatct aatttttccg ggaccgtaaa atatagtggc 300 

agtagctatc catttcctac caccagcgaa acgccgcgcg ttgtttataa ttcgagaacg 3 60 

gataagccgt ggccggtggc gctttatttg acgcctgtga gcagtgcggg cggggtggcg 420 

attaaagctg gctcattaat tgccgtgctt attttgcgac agaccaacaa ctataacagc 480 

gatgatttcc agtttgtgtg gaatatttac gccaataatg atgtggtggt gcctactggc 540 

ggctgcgatg tttctgctcg tgatgtcacc gttactctgc cggactaccc tggttcagtg 600 

ccaattcctc ttaccgttta ttgtgcgaaa agccaaaacc tggggtatta cctctccggc 660 

acaaccgcag atgcgggcaa ctcgattttc accaataccg cgtcgttttc acctgcacag 720 

ggcgtcggcg tacagttgac gcgcaacggt acgattattc cagcgaataa cacggtatcg 780 

ttaggagcag tagggacttc ggcggtgagt ctgggattaa cggcaaatta tgcacgtacc 840 

ggagggcagg tgactgcagg gaatgtgcaa tcgattattg gcgtgacttt tgtttatcaa 900 

taa 903 

<210> 208 
<211> 1631 
<212> DNA 
<213> E. Coli 



<400> 208 

gtgctgtcaa aactaccccg tagactccga tcttttcaaa catattgcac catccgtgta 60 

catcggggtg aggatatgaa atcaatggat aagttaacaa caggtgttgc ctatggcaca 120 

tcggcgggta atgctggttt ctgggcattg cagttactcg ataaagtaac tccgtcacag 180 

tgggctgcaa tcggtgtgct gggtagcctg gtttttggcc tgctgacgta tctgacaaat 240 

ctttatttca agattaaaga agacaggcgt aaggctgcga gaggagagta atccaatgac 300 

tcaagactat gaactggttg tgaaaggagt ccgtaatttt gagaataaag ttacggtaac 360 

tgtagcctta caggacaaag aacgctttga cggtgaaatt tttgacctgg atgtcgccat 420 
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ggaccgtgtt 
gcaagtcttc 
tcagtactcc 
tctatgaaca 
ggcgcatctg 
atggcatacc 
ggaaaaaccg 
attgagcgtg 
ccacaaaaag 
ccgtcgacgt 
cgctggtgga 
caggttattc 
agatattcat 
accgaggatg 
atgctcaatt 
ttaaagatgc 
gtgttaacca 
ataccgcaga 
tcaccaacat 
aacgctacat 
atcaaatgta 



gaaggagctg 
ctggaagtag 
tttaagattg 
caaaaatcag 
ctcctcagat 
gcgatggttc 
tttttcccaa 
ataaggcgct 
cgggtatcgc 
tttataagcg 
ttaaggatgg 
gtcgtgacca 
ggtgattttt 
gcaaaaaaaa 
tgctgctcga 
gcaacagaaa 
gttgcgtgcc 
tcttgccgct 
gctcggagat 
cgcaggagtg 

g 



cgctggagtt 
cagaaaaatt 
aaaatcctgc 
atacggcctg 
acttgaccag 
tggcatatgg 
tatgaaactg 
ggcatgggtg 
gtcattttgt 
gctgaatgct 
cggacgcgat 
ggagagcgca 
ctcgtgttgt 
tgggcggaac 
ataattgaac 
tctgctgaaa 
gaagcaaaaa 
gccgtcagag 
attgcagcag 
acttgtcaac 



ttatgaggca 
gtcagaaaaa 
caataagcac 
tcggctgccg 
tttctggacg 
accatctgtc 
tcgaaggaaa 
gagcgcaata 
ccctataaca 
ggtgatcgta 
tgccgcattc 
ttaacctgct 
caggatttat 
gtgatgctgc 
aggggcgaac 
tttctgccag 
aatatgccat 
gcaaaacaac 
aagctcagct 
agatctatga 



gcagccagaa 
gttgagtctt 
gagcgtcctc 
ttctggcgct 
aaaaagaagg 
ggggtgccac 
aatgcgacca 
ttaaagtacc 
ttggccccgg 
aaggtgcatg 
gttcaaataa 
gggggataga 
cgtcggaaat 
cgcattatca 
tatagcccgt 
ggctgcttat 
acgccttgac 
caaaaccgcc 
ttatgctgaa 
atctttaaga 



ggagcgtccg 
atctgcagca 
atcataaata 
gattggtgct 
taaccacaca 
agtggtggat 
ggtcaacgcc 
actgaccgaa 
taagtgtttc 
cgaagcgatt 
ctgttacggt 
acagtgaatc 
gtctggagcg 
caagaggtaa 
gatgaggctg 
ctgtctgata 
gcagcgaagc 
gaaggaatgc 
attgctgacg 
gataaaaagc 



<210> 209 
<211> 534 
<212> DNA 
<213> E. Coli 

<400> 209 
atgaacacaa aaatcagata 
gcatctgctc ctcagatact 
gcataccgcg atggttctgg 
aaaaccgttt ttcccaatat 
gagcgtgata aggcgctggc 
caaaaagcgg gtatcgcgtc 
tcgacgtttt ataagcggct 
tggtggatta aggatggcgg 
gttattcgtc gtgaccagga 

<210> 210 
<211> 312 
<212> DNA 
<213> E. Coli 

<400> 210 
atgactcaag actatgaact 
gtaactgtag ccttacagga 
gccatggacc gtgttgaagg 
gtccggcaag tcttcctgga 
cagcatcagt actcctttaa 
aaatatctat ga 

<210> 211 

<211> 291 

<212> DNA 

<213> E. Coli 



cggcctgtcg 
tgaccagttt 
catatggacc 
gaaactgtcg 
atgggtggag 
attttgtccc 
gaatgctggt 
acgcgattgc 
gagcgcatta 



gctgccgttc 
ctggacgaaa 
atctgtcggg 
aaggaaaaat 
cgcaatatta 
tataacattg 
gatcgtaaag 
cgcattcgtt 
acctgctggg 



tggcgctgat 
aagaaggtaa 
gtgccacagt 
gcgaccaggt 
aagtaccact 
gccccggtaa 
gtgcatgcga 
caaataactg 
ggatagaaca 



tggtgctggc 
ccacacaatg 
ggtggatgga 
caacgccatt 
gaccgaacca 
gtgtttcccg 
agcgattcgc 
ttacggtcag 
gtga 



ggttgtgaaa 
caaagaacgc 
agctgcgctg 
agtagcagaa 
gattgaaaat 



ggagtccgta 
tttgacggtg 
gagttttatg 
aaattgtcag 
cctgccaata 



attttgagaa 
aaatttttga 
aggcagcagc 
aaaaagttga 
agcacgagcg 



taaagttacg 
cctggatgtc 
cagaaggagc 
gtcttatctg 
tcctcatcat 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1631 



60 
120 
180 
240 
300 
360 
420 
480 
534 



60 
120 
180 
240 
300 
312 



<400> 211 

gtgctgtcaa aactaccccg tagactccga tcttttcaaa catattgcac catccgtgta 60 

catcggggtg aggatatgaa atcaatggat aagttaacaa caggtgttgc ctatggcaca 120 

tcggcgggta atgctggttt ctgggcattg cagttactcg ataaagtaac tccgtcacag 180 

tgggctgcaa tcggtgtgct gggtagcctg gtttttggcc tgctgacgta tctgacaaat 240 
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ctttatttca agattaaaga agacaggcgt aaggctgcga gaggagagta a 291 

<210> 212 

<211> 216 

<212> DNA 

<213> E . Coli 



<400> 212 

atgtcaaata aaatgactgg tttagtaaaa tggtttaacg ctgataaagg tttcggcttt 60 

atttctcctg ttgatggtag taaagatgtg tttgtgcatt tttctgcgat tcagaatgat 120 

aattatcgaa ccttatttga aggtcaaaag gttaccttct ctatagagag tggtgctaaa 180 

ggtcctgcag cagcaaatgt catcattact gattaa 216 

<210> 213 
<211> 1017 
<212> DNA 
<213> C . Coli 



<400> 213 

atgtttgtca tctggagcca tagaacaggg ttcatcatga gtcatcaact taccttcgcc 60 

gacagtgaat tcagcagtaa gcgccgtcag accagaaaag agattttctt gtcccgcatg 120 

gagcagattc tgccatggca aaacatggtg gaagtcatcg agccgtttta ccccaaggct 180 

ggtaatggcc ggcgacctta tccgctggaa accatgctac gcattcactg catgcagcat 240 

tggtacaacc tgagcgatgg cgcgatggaa gatgctctgt acgaaatcgc ctccatgcgt 300 

ctgtttgccc ggttatccct ggatagcgcc ttgccggacc gcaccaccat catgaatttc 360 

cgccacctgc tggagcagca tcaactggcc cgccaattgt tcaagaccat caatcgctgg 420 

ctggccgaag caggcgtcat gatgactcaa ggcaccttgg tcgatgccac catcattgag 480 

gcacccagct cgaccaagaa caaagagcag caacgcgatc cggagatgca tcagaccaag 54 0 

aaaggcaatc agtggcactt tggcatgaag gcccacattg gtgtcgatgc caagagtggc 600 

ctgacccaca gcctggtcac caccgcggcc aacgagcatg acctcaatca gctgggtaat 660 

ctgctgcatg gagaggagca atttgtctca gccgatgccg gctaccaagg ggcgccacag 720 

cgcgaggagc tggccgaggt ggatgtggac tggctgatcg ccgagcgccc cggcaaggta 780 

agaaccttga aacagcatcc acgcaagaac aaaacggcca tcaacatcga atacatgaaa 840 

gccagcatcc gggccagggt ggagcaccca tttcgcatca tcaagcgaca gttcggcttc 900 

gtgaaagcca gatacaaggg gttgctgaaa aacgataacc aactggcgat gttattcacg 960 

ctggccaacc tgtttcgggc ggaccaaatg atacgtcagt gggagagatc tcactaa 1017 



<210> 214 
<211> 474 
<212> DNA 
<213> E. Coli 



<400 
atggtatata 
aatcttaatg 
ttattgaaac 
ggctttggtc 
gatgatgatt 
acatatatta 
gatgaagcga 
attgtgtcat 



> 214 
taataatcgt 
ctgacgatga 
aaatatgcca 
ataataataa 
acattttgtt 
aatatgtcga 
aatctttaca 
ttatgttagg 



ttcccacgga 
gcactacaag 
gcattatgca 
tattgcggtg 
tttgaatccc 
aagtaagcgt 
tgattattcc 
gattaaggaa 



catgaagact 
attatcgtac 
ggcctggact 
gcgtatgtaa 
gatatcatca 
tatgctttta 
gtaagaaaat 
ggtgcgaaca 



acatcaaaaa 
gcgacaacaa 
atattagtgg 
aggaaaaata 
tgaagcatga 
gtacattatg 
ttcctgtgct 
agtccctgat 



attactcgaa 
agactctcta 
aggtgtatac 
tagacccgca 
tgatttgctg 
cctgttccga 
ttctgatttt 
atga 



60 
120 
180 
240 
300 
360 
420 
474 



<210> 215 
<211> 1119 
<212> DNA 
<213> E. Coli 



<400> 215 

atgggaaaaa gcatagtcgt tgtttctgcg gtcaatttta ccactggcgg tccatttacc 60 

attttgaaaa aatttttggc agcaactaat aataaagaaa atgtcagttt tatcgcatta 120 

gtccattctg ctaaagagtt aaaagaaagt tatccatggg ttaaattcat tgagtttcct 180 
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gaggttaaag ggtcgtggct aaaacgtttg cactttgaat atgtagtttg taaaaaactt 240 

tcaaaagagc tgaatgctac gcattggatt tgtctgcatg atattacggc caatgtcgtc 300 

actaaaaaaa gatatgtgta ttgtcataac cctgcacctt tttataaagg aattttattc 360 

cgtgaaattc ttatggagcc tagctttttc ttatttaaaa tgctatacgg gctgatatat 420 

aaaataaaca ttaaaaaaaa tactgcagtg tttgttcaac aattctggat gaaagaaaaa 480 

tttatcaaga aatattctat aaataacatc attgtcagtc ggccagaaat taaattatct 540 

gataaaagcc aacttactga tgatgattct caatttaaga ataacccttc tgagttgaca 600 

atattttacc ctgctgttcc acgagtattt aaaaattacg agcttattat tagtgcagca 660 

aggaaattga aagaacaatc caatattaaa tttctgctta ctatcagtgg tacagaaaat 720 

gcgtatgcaa aatatattat cagtcttgca gaaggactgg ataatgttca tttcctcggg 780 

tacttggata aagaaaaaat cgatcattgt tataatattt cagatatagt ttgttttccc 840 

tctaggttag aaacatgggg attgccgttg tctgaagcta aagagcgagg taagtgggta 900 

ttagcatcag atttcccatt tactagagaa actcttggta gttatgaaaa gaaagctttt 960 

tttgattcta ataacgatga catgttagtt aaacttatta ttgacttcaa aaaaggtaac 1020 

ctcaaaaaag atatctctga tgcaaatttc atttatcgta atgaaaatgt attagttggg 1080 

tttgatgaac tagttaattt tattactgaa gaacattga 1119 

<210> 216 
<211> 591 
<212> DNA 
<213> E. Coli 



<400> 216 

atgatcttaa aactcgctaa acgatatggt ctctgtggtt ttattcggct tgttagagat 60 

gtcttattga ctcgtgtatt ttaccggaac tgtagaatta ttcgatttcc ctgctatatt 120 

cgcaatgatg gtagcattaa ttttggtgaa aatttcacaa gtggagtcgg tctcaggctg 180 

gatgcatttg gacgtggcgt gatttttttt tccgataatg tgcaagttaa cgactatgtt 240 

catatcgcct caattgagag cgttacgata ggtcgggata cgcttattgc aagtaaagta 300 

tttattaccg atcataatca cggttccttt aagcactctg atccaatgag ttcgccaaat 360 

atacctccag acatgcgcac gttggaatct tcagctgttg taattggcca gagggtttgg 420 

ttgggtgaga atgtgacggt tttgcctgga acaattattg gtaatggagt cgtagtcggc 4 80 
gccaattctg ttgttagagg ttctattccc gaaaatactg tcattgcggg agtaccagca . 540 

aaaatcataa agaaatacaa tcatgagacc aaattatggg aaaaagcata g 591 

<210> 217 
<211> 993 
<212> DNA 
<213> E. Coli 

<400> 217 

atgtattttt tgaatgattt aaatttctct agacgcgatg ctggatttaa agcaagaaaa 60 

gatgcactgg acattgcttc agattatgaa aacatttctg ttgttaacat tcctctatgg 120 

ggtggagtag tccagagaat tattagttct gttaagctta gtacatttct ctgcggtctt 180 

gaaaataaag atgttttaat tttcaatttc ccgatggcca aaccattttg gcatatattg 240 

tcattctttc accgccttct aaaatttaga atagtacctc tgattcatga tattgatgaa 300 

ttaagaggag gagggggtag tgattctgtg cggcttgcta cctgtgatat ggtcataagt 360 

cacaatccac aaatgacaaa gtaccttagt aaatatatgt ctcaggataa aatcaaagac 420 

ataaaaatat ttgattacct cgtctcatct gatgtggagc atcgagatgt tacggataag 480 

caacgagggg tcatatatgc tggcaacctt tctaggcata aatgttcttt catatatact 540 

gaaggatgcg attttactct ctttggtgtc aactatgaaa ataaagataa tcctaaatat 600 

cttggaagtt ttgatgctca atctccggaa aagattaacc tcccaggcat gcaatttgga 660 

ctcatttggg atggagattc tgtcgaaacc tgtagtggtg cctttggcga ctatttaaag 720 

tttaataacc ctcataagac atctctttat ctttcaatgg aacttccagt atttatatgg 780 

gataaagccg cccttgcgga tttcattgta gataatagaa taggatatgc agtgggatca 840 

atcaaagaaa tgcaagagat tgttgactcc atgacaatag aaacttataa gcaaattagt 900 

gagaatacaa aaattatttc tcagaaaatt cgaacaggaa gttacttcag ggatgttctt 960 

gaagaggtga tcgatgatct taaaactcgc taa 993 

<210> 218 
<211> 1167 
<212> DNA 
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<213> E . Coli 



<400> 
atgatctatc 
aaggatatat 
tatgaaataa 
ctttgcaatg 
aatatcagaa 
ggcttgttag 
ttcgggacta 
tcaagaaatt 
tggtctaaaa 
gtattcatct 
tcttatgcat 
gtaggtgttc 
tattatctat 
caagtatcta 
ttaacaggtg 
aatgtttata 
atggttattc 
gtgaaaatat 
ttcatgacta 
tttcttaagg 



218 
ttgtaattag 
tttatccagc 
cgtcagatat 
ttttgacatt 
aagtcaataa 
ttatttcttt 
gcttacttag 
tctcagcata 
aatttactaa 
ttgcaattat 
tcatcgtagg 
tattctcctt 
ccatgtattt 
actctgccag 
gagtctctat 
ctgctttttc 
atggctgtat 
tttattcata 
atattagcag 
cccagaaaat 



tgtctttctc 
cgtatgcgtt 
atatgctttt 
taccctgtca 
tgctatttat 
ttcgatgata 
ctatatgaat 
catgcagcca 
tacaaaggta 
actgaatact 
tgttaataga 
gtatatgctc 
ggtcagccct 
ttctcatgtc 
gtcgttgcat 
ggattatgtt 
ttcaggtgtt 
ttttatttat 
ttggatacaa 
aaagtga 



attacagcat 
aatatcatct 
cagttaaatg 
tgtttattga 
agcataccat 
tatatatgca 
ttgataagag 
atcattctaa 
agtaaaacat 
ggtaagcaaa 
gtaaaacatt 
tttttacgtg 
ataatcgcgt 
ttttggtttt 
aaagaatttg 
tatatttccg 
ttatggagat 
accttttctt 
ataactcttt 



ttatctgttt 
tcgcactggt 
acgctacgtt 
cggaaagtgt 
cgaagaaagt 
tgaggttaag 
atgctgatgt 
ctacttttgc 
ttactttact 
ttgtctttat 
atgtttatct 
gactgcctgg 
ttcaggagtt 
ttgaaaggct 
tgtgggtggg 
cggagctaag 
tgtctcgaaa 
tcatttttta 
gtatcatagt 



atatcttaag 
cttattggga 
gatttttcta 
attagatcta 
gcataatgta 
taactaccag 
tgaagacaca 
tttatttatt 
tgtttttatt 
ggttatcatc 
tattacagct 
ggggatggca 
ttattttcag 
gatggggcta 
tttgccaaca 
ctatttgatg 
ttacatatct 
tcatgaaagc 
attctctcaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1167 



<210> 219 
<211> 1104 
<212> DNA 
<213> E. Coli 



<400> 219 

atgtacgatt atatcattgt tggttctggt ttgtttggtg ccgtttgtgc gaatgagtta 60 

aaaaagctaa acaaaaaagt tttagtgatt gagaaaagaa atcatatcgg tggaaatgcg 120 

tacacagagg actgtgaggg tatccagatt cataaatatg gtgcacatat ttttcatacc 180 

aatgataaat atatatggga ttacgttaat gatttagtag aatttaatcg ttttactaat 240 

tctccactgg cgatttataa agacaaatta ttcaaccttc cttttaatat gaatactttc 300 

caccaaatgt ggggagttaa agatcctcaa gaagctcaaa atatcattaa tgctcagaaa 360 

aaaaagtatg gtgacaaggt acctgaaaat ttggaggagc aggcgatttc attagttggg 420 

gaggacttat accaagcatt gataaagggt tatacggaga agcagtgggg aagaagtgca 4 80 

aaagaattgc ctgcatttat tattaagcga atcccagtga gatttacgtt tgataacaat 540 

tatttttccg atcgctatca aggtattccg gtgggaggct acactaagct tattgaaaaa 600 

atgcttgaag gtgtggacgt aaaattaggc attgattttt tgaaagacaa agattctcta 660 

gcgagtaaag cccatagaat catctacact ggacccattg atcagtactt cgactatagg 720 

tttggagcgt tagaatatcg ctctttaaaa tttgagacgg aacgccatga atttccaaac 780 

ttccaaggga atgcagtaat aaatttcact gatgctaatg taccatatac cagaataatt 840 

gagcataaac attttgacta tgttgagaca aagcatacgg ttgttacaaa agaatatcca 900 

ttagagtgga aagttggcga cgaaccctac tatccagtta atgataataa aaacatggag 960 

ctttttaaga aatatagaga gttagctagc agagaagaca aggttatatt tggcgggcgt 1020 

ttggccgagt ataaatatta tgatatgcat caagtgatat ctgccgctct ttatcaagtg 1080 

aaaaatataa tgagtacgga ttaa 1104 

<210> 220 
<211> 1116 
<212> DNA 
<213> E. Coli 



<400> 220 

atgttcccaa aaataatgaa tgatgaaaac tttttcaaaa aagcggcggc gcacggggag 60 

gaacctcctt taactcctca aaacgaacat cagcggtccg ggctgcgctt cgcccgtcgc 120 

gtcagactac cccgtgcggt tggcctggct ggcatgttct taccgattgc ttcaacgctg 180 

gtttcacacc cgccgccggg ctggtggtgg ctggtgttgg tcggctgggc gttcgtctgg 240 

ccgcatttag cctggcagat agcgagcagg gccgtcgatc cgcttagccg ggaaatttac 300 
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aacttaaaaa ccgatgcagt attagcggga atgtgggtag gcgtaatggg cgtaaacgtg 360 

ctgccttcca ccgcgatgtt gatgattatg tgtctgaatt tgatgggggc aggcggcccc 4 20 

cgtctgtttg tcgcgggtct ggtgttgatg gtggtttcct gccttgtcac cctcgagctg 480 

acgggcatta ccgtgtcgtt caatagtgcg ccgctggaat ggtggctctc ccttcccatt 540 

attgtcattt atcctctgct gtttggctgg gtcagctacc agacggcaac caaactggcg 600 

gaacataaac gcaggttgca ggtcatgagt acccgcgacg gcatgacggg cgtgtataac 660 

cgacgtcatt gggaaactat gttacgcaat gaatttgata actgtcggcg gcataatcgc 720 

gatgcaacgt tactgattat cgatatcgac catttcaaga gcatcaacga tacctggggc 780 

catgatgtgg gcgatgaagc gattgtggcg cttacccgac agttacaaat taccctgcgc 840 

ggtagcgatg tgattggtcg gtttggcggc gatgagtttg cagtaatcat gtccggtacg 900 

ccagctgaga gcgccattac cgccatgtta cgggtgcatg aagggctaaa tacattacgt 960 

ttgccgaata cgccacaggt aactttacgg attagtgtgg gggttgcgcc gctgaaccca 1020 

caaatgagtc actatcgtga gtggttgaaa tcggcagatt tggcgcttta caaagcaaag 1080 

aaagccggac gtaaccgcac cgaagtggcg gcctga 1116 

<210> 221 
<211> 1404 
<212> DNA 
<213> E. Coli 



<400> 221 

ttggatgtga acgttgatca gttcgatact gaagctttcc gtactgacaa actggaactg 60 

accagcggca acatcgctga ccataacggt aacgtagtat ctggtgtgtt cgatatccat 120 

agcagcgatt acgttctgaa cgctgatctg gtgaacgacc gtacctggga tacttccaag 180 

tctaactacg gttacggtat tgttgctatg aactctgatg gtcacctgac tatcaacggt 240 

aacggcgacg tagacaacgg tactgaactg gataacagct ctgtagacaa tgttgttgct 300 

gcaaccggta actacaaagt tcgtatcgac aacgcaactg gcgctggcgc tatcgctgat 360 

tacaaagata aagaaattat ctacgtaaac gacgtcaaca gcaacgcgac cttctctgct 420 

gctaacaaag ctgacctggg tgcatacacc tatcaggctg aacagcgcgg taacaccgtt 4 80 

gttctgcaac agatggagct gaccgactac gctaacatgg cgctgagcat cccgtctgcg 540 

aacaccaata tctggaacct ggaacaagac accgttggta ctcgtctgac caactctcgt 600 

catggcctgg ctgataacgg cggcgcatgg gtaagctact tcggtggtaa cttcaacggc 660 

gacaacggca ccatcaacta tgatcaggat gttaacggca tcatggtcgg tgttgatacc 720 

aaaattgacg gtaacaacgc taagtggatc gtcggtgcgg ctgcaggctt cgctaaaggt 780 

gacatgaatg accgttctgg tcaggtggat caagacagcc agactgccta catctactct 840 

tctgctcact tcgcgaacaa cgtctttgtt gatggtagct tgagctactc tcacttcaac 900 

aacgacctgt ctgcaaccat gagcaacggt acttacgttg acggtagcac caactccgac 960 

gcttggggct tcggtttgaa agccggttac gacttcaaac tgggtgatgc tggttacgtg 1020 

actccttacg gcagcgtttc tggtctgttc cagtctggtg atgactacca gctgagcaac 1080 

gacatgaaag ttgacggtca gtcttacgac agcatgcgtt atgaactggg tgtagatgca 1140 

ggttatacct tcacctacag cgaagatcag gctctgactc cgtacttcaa actggcttac 1200 

gtctacgacg actctaacaa cgataacgat gtgaacggcg attccatcga taacggtact 1260 

gaagggtctg cggtacgtgt tggtctgggt actcagttta gcttcaccaa gaacttcagc 1320 

gcctataccg atgctaacta cctcggtggt ggtgacgtag atcaagactg gtccgcgaac 1380 

gtgggtgtta aatatacctg gtaa 1404 



<210> 222 
<211> 669 
<212> DNA 
<213> E. Coli 



<400 
atgcccgtca 
aaacctcttc 
gagtttaata 
gttattttag 
caggctcctt 
ttaatatcag 
attgaacaaa 
ctggaattac 
ttattatcaa 



> 222 
aggatttgac 
aggaatttgg 
atgaaaagca 
agggagttat 
atattatggg 
aaggaaattg 
atcaactctg 
gcgacgtgca 
tgattgactg 



gggcattact 
taagctcgat 
agttatattt 
ctctctgcgt 
gctggctgat 
tacgggatat 
gcgagacgct 
gctcattggg 
gaatgaagaa 



gcaaaggacg 
aaatgtttgt 
tccagtgatg 
agagaagaaa 
ggtttaatga 
catctaccag 
ttttactggt 
cataattcct 
ttgcgatcac 



cgcaaatgtt 
ccagatacgg 
tcaataacga 
acgtacttat 
aaaacgatat 
ccaaacaaac 
tagcctggca 
acgaacaaat 
gtattggtgt 



atctgtagtt 
tacgcgcttc 
agatactttc 
cggtattacc 
accatacaaa 
cattacgctt 
aaatagaatt 
ccgcgcaaca 
gatgaattat 



60 
120 
1B0 
240 
300 
360 
420 
480 
540 
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atccatcaac gtacacgcat atcgcgttct gtcgtcgcag aagttctcgc tgctttgcgt 600 
aaaggcggct atatcgaaat gaataaaggc aaactggtcg ctatcaaccg tttgccttca 660 
gagtattaa 669 

<210> 223 
<211> 255 
<212> DNA 
<213> E. Coli 



<400> 223 

atgaccgata aaatccgtac tctgcaaggt cgcgttgtta gcgacaaaat ggagaaatcc 60 

attgttgttg ctatcgaacg ttttgtgaaa cacccgatct acggtaaatt catcaagcgt 120 

acgaccaaac tgcacgtaca tgacgagaac aacgaatgcg gtatcggtga cgtggttgaa 180 

atccgcgaat gccgtccgct gtccaagact aaatcctgga cgctggttcg cgttgtagag 240 

aaagcggttc tgtaa 255 

<210> 224 
<211> 192 
<212> DNA 
<213> E. Coli 



<400> 224 

atgaaagcaa aagagctgcg tgagaagagc gttgaagagc tgaacaccga gctgctgaac 60 

ctgctgcgtg agcagttcaa cctgcgtatg caggctgcaa gtggccagct gcaacagtct 120 

cacctgttga agcaagtgcg tcgcgatgtc gcacgcgtta agactttact gaacgagaag 180 

gcgggtgcgt aa 192 

<210> 225 
<211> 411 
<212> DNA 
<213> E. Coli 



<400> 225 

atgttacaac caaagcgtac aaaattccgt aaaatgcaca aaggccgtaa ccgcggtctg 60 

gcgcagggta cggatgttag cttcggcagc ttcggtctga aagctgttgg ccgtggtcgt 120 

ctgactgccc gtcagatcga agcagcacgt cgtgctatga cccgtgcagt taagcgtcaa 180 

ggtaagatct ggatccgtgt gttcccggac aaaccgatca ctgaaaagcc gctggcagtg 240 

cgtatgggta aaggtaaagg taacgtggag tattgggttg ccttgattca gccgggtaaa 300 

gtcctgtatg aaatggacgg tgttccggaa gagctggccc gtgaagcatt caagctggca 360 

gcagcgaaac tgccgattaa aaccaccttt gtaactaaga cggtgatgta a 411 

<210> 226 

<211> 702 

<212> DNA 

<213> E. Coli 



<400> 226 

atgggtcaga aagtacatcc taatggtatt cgcctgggta ttgtaaaacc atggaactct 60 

acctggtttg cgaacaccaa agaattcgct gacaacctgg acagcgattt taaagtacgt 120 

cagtacctga ctaaggaact ggctaaagcg tccgtatctc gtatcgttat cgagcgtccg 180 

gctaagagca tccgtgtaac cattcacact gctcgcccgg gtatcgttat cggtaaaaaa 240 

ggtgaagacg tagaaaaact gcgtaaggtc gtagcggaca tcgctggcgt tcctgcacag 300 

atcaacatcg ccgaagttcg taagcctgaa ctggacgcaa aactggttgc tgacagcatc 360 

acttctcagc tggaacgtcg cgttatgttc cgtcgtgcta tgaagcgtgc tgtacagaac 420 

gcaatgcgtc tgggcgctaa aggtattaaa gttgaagtta gcggccgtct gggcggcgcg 480 

gaaatcgcac gtaccgaatg gtaccgcgaa ggtcgcgtac cgctgcacac tctgcgtgct 540 

gacatcgact acaacacctc tgaagcgcac accacttacg gtgtaatcgg cgttaaagtg 600 

tggatcttca aaggcgagat cctgggtggt atggctgctg ttgaacaacc ggaaaaaccg 660 

gctgctcagc ctaaaaagca gcagcgtaaa ggccgtaaat aa 702 



<210> 227 
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<211> 333 
<212> DNA 
<213> E. Coli 



<400> 227 

atggaaacta tcgctaaaca tcgccatgct cgttcttctg ctcagaaggt tcgccttgtt 60 

gctgacctga ttcgcggtaa gaaagtgtcg caggctctgg atattttgac ctacaccaac 120 

aagaaagcgg ctgtactggt caagaaagtt ctggaatctg ccattgctaa cgctgaacac 180 

aacgatggcg ctgacattga cgatctgaaa gttacgaaaa ttttcgtaga cgaaggcccg 240 

agcatgaagc gcattatgcc gcgtgcaaaa ggtcgtgcag atcgcatcct gaagcgcacc 300 

agccacatca ctgtggttgt gtccgatcgc tga 333 

<210> 228 

<211> 279 

<212> DNA 

<213> E. Coli 



<400> 228 

atgccacgtt ctctcaagaa aggtcctttt attgacctgc acttgctgaa gaaggtagag 60 

aaagcggtgg aaagcggaga caagaagccc ctgcgcactt ggtcccgtcg ttcaacgatc 120 

tttcctaaca tgatcggttt gaccatcgct gtccataatg gtcgtcagca cgttccggta 180 

tttgtaaccg acgaaatggt tggtcacaaa ctgggtgaat tcgcaccgac tcgtacttat 240 

cgcggccacg ctgctgataa aaaagcgaag aagaaataa 279 

<210> 229 
<211> 822 
<212> DNA 
<213> E. Coli 



<400> 229 

atggcagttg ttaaatgtaa accgacatct ccgggtcgtc gccacgtagt taaagtggtt 60 

aaccctgagc tgcacaaggg caaacctttt gctccgttgc tggaaaaaaa cagcaaatcc 120 

ggtggtcgta acaacaatgg ccgtatcacc actcgtcata tcggtggtgg ccacaagcag 180 

gcttaccgta ttgttgactt caaacgcaac aaagacggta tcccggcagt tgttgaacgt 240 

cttgagtacg atccgaaccg ttccgcgaac atcgcgctgg ttctgtacaa agacggtgaa 300 

cgccgttaca tcctggcccc taaaggcctg aaagctggcg accagattca gtctggcgtt 360 

gatgctgcaa tcaaaccagg taacaccctg ccgatgcgca acatcccggt tggttctact 420 

gttcataacg tagaaatgaa accaggtaaa ggcggtcagc tggcacgttc cgctggtact 480 

tacgttcaga tcgttgctcg tgatggtgct tatgtcaccc tgcgtctgcg ttctggtgaa 54 0 

atgcgtaaag tagaagcaga ctgccgtgca actctgggcg aagttggcaa tgctgagcat 600 

atgctgcgcg ttctgggtaa agcaggtgct gcacgctggc gtggtgttcg tccgaccgtt 660 

cgcggtaccg cgatgaaccc ggtagaccac ccacatggtg gtggtgaagg tcgtaacttt 720 

ggtaagcacc cggtaactcc gtggggcgtt cagaccaaag gtaagaagac ccgcagcaac 780 

aagcgtactg ataaattcat cgtacgtcgc cgtagcaaat aa 822 

<210> 230 
<211> 303 
<212> DNA 
<213> E. Coli 



<400> 230 

atgattcgtg aagaacgtct gctgaaggtg ctgcgtgcac cgcacgtttc tgaaaaagcg 60 

tctactgcga tggaaaaatc caacaccatc gtactcaaag ttgctaaaga cgcgaccaaa 120 

gcagaaatca aagctgctgt gcagaaactg tttgaagtcg aagtcgaagt cgttaacacc 180 

ctggtagtta aagggaaagt taaacgtcac ggacagcgta tcggtcgtcg tagcgactgg 240 

aaaaaagctt acgtcaccct gaaagaaggc cagaatctgg acttcgttgg cggcgctgag 300 



<210> 231 
<211> 630 
<212> DNA 



WO 00/44906 



PCT/USOO/02200 



<213> E. Coli 



<400> 
atgattggtt 
tctatcccag 
gctaacgatg 
accaagcctg 
gaattccgcc 
tttgctgacg 
accgttaagc 
cgcgttccgg 
atggcaggtc 
gacgctgagc 
ctgatcgtta 



231 
tagtcggtaa 
taaccgtaat 
gctaccgtgc 
aagctggcca 
tggctgaagg 
ttaaaaaagt 
gctggaactt 
gttctatcgg 
agatgggtaa 
gcaacctgct 
aaccagctgt 



aaaagtgggt 
cgaagttgaa 
tattcaggtg 
cttcgctaaa 
cgaagagttc 
tgacgtaact 
ccgtacccag 
tcagaaccag 
cgaacgtgta 
gctggttaaa 
gaaggcgtaa 



atgacccgta 
gcaaaccgcg 
accaccggtg 
gctggcgtag 
actgtaggtc 
ggcacctcta 
gacgctactc 
actccgggca 
accgttcaga 
ggtgctgtcc 



tcttcacaga 


agacggcgtt 


60 


ttactcaggt 


taaagacctg 


120 


ctaaaaaagc 


taaccgtgtg 


180 


aagctggccg 


tggtctgtgg 


240 


agagcattag 


cgttgaactg 


300 


aaggtaaagg 


tttcgcaggt 


360 


acggtaactc 


cttgtctcac 


420 


aagtgttcaa 


aggcaagaaa 


480 


gccttgacgt 


agtacgcgtt 


540 


cgggtgcaac 


cggtagcgac 


600 






630 



<210> 232 
<211> 606 
<212> DNA 
<213> E. Coli 



<400 
atggaattag 
cgtgatttca 
cagggtactc 
cgccagaaag 
ggtggcgtga 
taccgcggcg 
gtcgagaagt 
atggctctgg 
gcgcgcaacc 
atcgccttcg 
gcatga 



> 232 
tattgaaaga 
acgaagcgct 
gtgctcagaa 
gcaccggccg 
cctttgctgc 
cgctgaaaag 
tctctgtaga 
aagatgtgct 
tgcacaaggt 
acaaagtcgt 



<210> 233 
<211> 312 
<212> DNA 
<213> E. Coli 



cgcgcagagc 
ggttcaccag 
gactcgtgct 
tgcgcgttct 
tcgtccgcag 
catcctgtcc 
agcgccgaaa 
gatcatcacc 
tgacgtacgc 
aatgactgct 



gcgctgactg 
gttgttgttg 
gaagtaactg 
ggttctatca 
gaccacagtc 
gaactggtac 
actaagctgc 
ggtgagctgg 
gatgcaactg 
gatgctgtta 



tttccgaaac 


taccttcggt 


60 


cttatgcagc 


tggtgctcgt 


120 


gttccggtaa 


aaaaccgtgg 


180 


agagcccgat 


ctggcgttct 


240 


aaaaagttaa 


caagaagatg 


300 


gtcaggatcg 


tctgatcgtt 


360 


tggcacagaa 


actgaaagac 


420 


acgaaaacct 


gttcctggct 


480 


gtatcgaccc 


ggttagcctg 


540 


agcaagttga 


ggagatgctg 


600 






606 



<400> 233 

atgcagaacc aaagaatccg tatccgcctg aaagcgtttg 

gcaaccgcgg aaatcgtcga gactgccaag cgcactggtg 

ccgctgccga cacgcaaaga gcgcttcact gttctgatct 

gcgcgcgatc agtacgaaat ccgtactcac ttgcgtctgg 

gagaaaaccg ttgatgctct gatgcgtctg gatctggctg 
agcctgggtt aa 



atcatcgtct 


gatcgatcaa 


60 


cgcaggtccg 


tggtccgatc 


120 


ccccgcacgt 


caacaaagac 


180 


ttgacatcgt 


tgagccaacc 


240 


ccggtgtaga 


cgtgcagatc 


300 






312 



<210> 234 
<211> 357 
<212> DNA 
<213> E. Coli 



<400 
atggctcgcg 
gctaaaggct 
aaagctggtc 
tggattgcgc 
aatggcctga 
ttcgacaaag 



> 234 
taaaacgtgg 
actacggtgc 
agtatgctta 
gtatcaacgc 
aaaaagcctc 
tagcgttcac 



tgttattgca 
gcgttctcgc 
ccgtgaccgt 
agcagcacgt 
tgttgaaatc 
cgctctggtt 



cgtgcacgtc 
gtataccgcg 
cgtcaacgta 
cagaacggta 
gaccgtaaga 
gaaaaagcga 



acaagaaaat 


tttgaaacaa 


60 


ttgccttcca 


ggctgttatc 


120 


agcgtcagtt 


ccgtcaactg 


180 


tttcttacag 


caaattcatc 


240 


tcctggctga 


tatcgcagta 


300 


aagcagctct 


ggcataa 


357 



<210> 235 
<211> 198 



-79- 



WO 00/44906 



PCT/US00/02200 



<212> DNA 
<213> E. Coli 

<400> 235 

atgccaaaaa ttaagaccgt acgcggtgct gctaagcgct tcaaaaaaac cggtaaaggt 60 

ggttttaagc acaagcacgc taacctgcgt cacattctga ccaaaaaagc gaccaaacgt 120 

aaacgtcacc tgcgtccgaa agccatggtt tccaaaggcg atctgggcct ggtaatcgcg 180 
tgcctgccgt acgcataa " " " 

<210> 236 
<211> 543 
<212> DNA 
<213> E. Coli 

<400> 236 

attaaaggcg gaaaacgagt tcaaacggcg cgccctaacc gtatcaatgg cgaaattcgc 60 

gcccaggaag ttcgcttaac aggtctggaa ggcgagcagc ttggtattgt gagtctgaga 120 

gaagctctgg agaaagcaga agaagccgga gtagacttag tcgagatcag ccctaacgcc 180 

gagccgccgg tttgtcgtat aatggattac ggcaaattcc tctatgaaaa gagcaagtct 240 

tctaaggaac agaagaaaaa gcaaaaagtt atccaggtta aggaaattaa attccgtcct 300 

ggtacagatg aaggcgacta tcaggtaaaa ctccgcagcc tgattcgctt tctcgaagag 360 

ggtgataaag ccaaaatcac gctgcgtttc cgcggtcgtg agatggcgca ccagcaaatc 420 

ggtatggaag tgcttaatcg cgtgaaagac gatttgcaag aactggcagt ggtcgaatcc 480 

ttcccaacga agatcgaagg ccgccagatg atcatggtgc tcgctcctaa gaagaaacag 540 



<210> 237 
<211> 1929 
<212> DNA 
<213> E. Coli 

<400> 237 

atgcctgtta taactcttcc tgatggcagc caacgccatt acgatcacgc tgtaagcccc 60 

atggatgttg cgctggacat tggtccaggt ctggcgaaag cctgtatcgc agggcgcgtt 120 

aatggcgaac tggttgatgc ttgcgatctg attgaaaacg acgcacaact gtcgatcatt 180 

accgccaaag acgaagaagg tctggagatc attcgtcact cctgtgcgca cctgttaggg 240 

cacgcgatta aacaactttg gccgcatacc aaaatggcaa tcggcccggt tattgacaac 300 

ggtttttatt acgacgttga tcttgaccgc acgttaaccc aggaagatgt cgaagcactc 360 

gagaagcgga tgcatgagct tgctgagaaa aactacgacg tcattaagaa gaaagtcagc 420 

tggcacgaag cgcgtgaaac tttcgccaac cgtggggaga gctacaaagt ctccattctt 4 80 

gacgaaaaca tcgcccatga tgacaagcca ggtctgtact tccatgaaga atatgtcgat 540 

atgtgccgcg gtccgcacgt accgaacatg cgtttctgcc atcatttcaa actaatgaaa 600 

acggcagggg cttactggcg tggcgacagc aacaacaaaa tgttgcaacg tatttacggt 660 

acggcgtggg cagacaaaaa agcacttaac gcttacctgc agcgcctgga agaagccgcg 720 

aaacgcgacc accgtaaaat cggtaaacag ctcgacctgt accatatgca ggaagaagcg 780 

ccgggtatgg tattctggca caacgacggc tggaccatct tccgtgaact ggaagtgttt 840 

gttcgttcta aactgaaaga gtaccagtat caggaagtta aaggtccgtt catgatggac 900 

cgtgtcctgt gggaaaaaac cggtcactgg gacaactaca aagatgcaat gttcaccaca 960 

tcttctgaga accgtgaata ctgcattaag ccgatgaact gcccgggtca cgtacaaatt 1020 

ttcaaccagg ggctgaagtc ttatcgcgat ctgccgctgc gtatggccga gtttggtagc 1080 

tgccaccgta acgagccgtc aggttcgctg catggcctga tgcgcgtgcg tggatttacc 1140 

caggatgacg cgcatatctt ctgtactgaa gaacaaattc gcgatgaagt taacggatgt 1200 

atccgtttag tctatgatat gtacagcact tttggcttcg agaagatcgt cgtcaaactc 1260 

tccactcgtc ctgaaaaacg tattggcagc gacgaaatgt gggatcgtgc tgaggcggac 1320 

ctggcggttg cgctggaaga aaacaacatc ccgtttgaat atcaactggg tgaaggcgct 1380 

ttctacggtc cgaaaattga atttaccctg tatgactgcc tcgatcgtgc atggcagtgc 1440 

ggtacagtac agctggactt ctctttgccg tctcgtctga gcgcttctta tgtaggcgaa 1500 

gacaatgaac gtaaagtacc ggtaatgatt caccgcgcaa ttctggggtc gatggaacgt 1560 

ttcatcggta tcctgaccga agagttcgct ggtttcttcc cgacctggct tgcgccggtt 1620 

caggttgtta tcatgaatat taccgattca cagtctgaat acgttaacga attgacgcaa 1680 

aaactatcaa atgcgggcat tcgtgttaaa gcagacttga gaaatgagaa gattggcttt 1740 
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aaaatccgcg agcacacttt gcgtcgcgtc ccatatatgc tggtctgtgg tgataaagag 1800 

gtggaatcag gcaaagttgc cgttcgcacc cgccgtggta aagacctggg aagcatggac 1860 

gtaaatgaag tgatcgagaa gctgcaacaa gagattcgca gccgcagtct taaacaattg 1920 

gaggaataa 1929 



<210> 238 
<211> 1353 
<212> DNA 
<213> E. Coli 

<400> 238 

atgactaaac actatgatta catcgccatc ggcggcggca gcggcggtat cgcctccatc 60 

aaccgcgcgg ctatgtacgg ccagaaatgt gcgctgattg aagccaaaga gctgggcggc 120 

acctgcgtaa atgttggctg tgtgccgaaa aaagtgatgt ggcacgcggc gcaaatccgt 180 

gaagcgatcc atatgtacgg cccggattat ggttttgata ccactatcaa taaattcaac 240 

tgggaaacgt tgatcgccag ccgtaccgcc tatatcgacc gtattcatac ttcctatgaa 300 

aacgtgctcg gtaaaaataa cgttgatgta atcaaaggct ttgcccgctt cgttgatgcc 360 

aaaacgctgg aggtaaacgg cgaaaccatc acggccgatc atattctgat cgccacaggc 420 

ggtcgtccga gccacccgga tattccgggc gtggaatacg gtattgattc tgatggcttc 480 

ttcgcccttc ctgctttgcc agagcgcgtg gcggttgttg gcgcgggtta catcgccgtt 54 0 

gagctggcgg gcgtgattaa cggcctcggc gcgaaaacgc atctgtttgt gcgtaaacat 600 

gcgccgctgc gcagcttcga cccgatgatt tccgaaacgc tggtcgaagt gatgaacgcc 660 

gaaggcccgc agctgcacac caacgccatc ccgaaagcgg tagtgaaaaa taccgatggt 720 

agcctgacgc tggagctgga agatggtcgc agtgaaacgg tggattgcct gatttgggcg 780 

attggtcgcg agcctgccaa tgacaacatc aacctggaag ccgctggcgt taaaactaac 840 

gaaaaaggct atatcgtcgt cgataaatat caaaacacca atattgaagg tatttacgcg 900 

gtgggcgata acacgggtgc agtggagctg acaccggtgg cagttgcagc gggtcgccgt 960 

ctctctgaac gcctgtttaa taacaagccg gatgagcatc tggattacag caacattccg 1020 

accgtggtct tcagccatcc gccgattggt actgttggtt taacggaacc gcaggcgcgc 1080 

gagcagtatg gcgacgatca ggtgaaagtg tataaatcct ctttcaccgc gatgtatacc 114 0 

gccgtcacca ctcaccgcca gccgtgccgc atgaagctgg tgtgcgttgg atcggaagag 1200 

aagattgtcg gtattcacgg cattggcttt ggtatggacg aaatgttgca gggcttcgcg 1260 

gtggcgctga agatgggggc aaccaaaaaa gacttcgaca ataccgtcgc cattcaccca 1320 

acggcggcag aagagttcgt gacaatgcgt taa 1353 

<210> 239 
<211> 2904 
<212> DNA 
<213> E. Coli 

<400> 239 

aaggttaagc ctcacggttc attagtaccg gttagctcaa cgcatcgctg cgcttacaca 60 

cccggcctat caacgtcgtc gtcttcaacg ttccttcagg acccttaaag ggtcagggag 120 

aactcatctc ggggcaagtt tcgtgcttag atgctttcag cacttatctc ttccgcattt 180 

agctaccggg cagtgccatt ggcatgacaa cccgaacacc agtgatgcgt ccactccggt 240 

cctctcgtac taggagcagc ccccctcagt tctccagcgc ccacggcaga tagggaccga 300 

actgtctcac gacgttctaa acccagctcg cgtaccactt taaatggcga acagccatac 360 

ccttgggacc tacttcagcc ccaggatgtg atgagccgac atcgaggtgc caaacaccgc 420 

cgtcgatatg aactcttggg cggtatcagc ctgttatccc cggagtacct tttatccgtt 480 

gagcgatggc ccttccattc agaaccaccg gatcactatg acctgctttc gcacctgctc 540 

gcgccgtcac gctcgcagtc aagctggctt atgccattgc actaacctcc tgatgtccga 600 

ccaggattag ccaaccttcg tgctcctccg ttactcttta ggaggagacc gccccagtca 660 

aactacccac cagacactgt ccgcaacccg gattacgggt caacgttaga acatcaaaca 720 

ttaaagggtg gtatttcaag gtcggctcca tgcagactgg cgtccacact tcaaagcctc 780 

ccacctatcc tacacatcaa ggctcaatgt tcagtgtcaa gctatagtaa aggttcacgg 840 

ggtctttccg tcttgccgcg ggtacactgc atcttcacag cgagttcaat ttcactgagt 900 

ctcgggtgga gacagcctgg ccatcattac gccattcgtg caggtcggaa cttacccgac 960 

aaggaatttc gctaccttag gaccgttata gttacggccg ccgtttaccg gggcttcgat 1020 

caagagcttc gcttgcgcta accccatcaa ttaaccttcc ggcaccgggc aggcgtcaca 1080 

ccgtatacgt ccactttcgt gtttgcacag tgctgtgttt ttaataaaca gttgcagcca 1140 

gctggtatct tcgactgatt tcagctccat ccgcgaggga cctcacctac atatcagcgt 1200 
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gccttctccc gaagttacgg caccattttg cctagttcct tcacccgagt tctctcaagc 1260 

gccttggtat tctctacctg accacctgtg tcggtttggg gtacgatttg atgttacctg 1320 

atgcttagag gcttttcctg gaagcagggc atttgttgct tcagcaccgt agtgcctcgt 1380 

catcacgcct cagccttgat tttccggatt tgcctggaaa accagcctac acgcttaaac 1440 

cgggacaacc gtcgcccggc caacatagcc ttctccgtcc ccccttcgca gtaacaccaa 1500 

gtacaggaat attaacctgt ttcccatcga ctacgccttt cggcctcgcc ttaggggtcg 1560 

actcaccctg ccccgattaa cgttggacag gaacccttgg tcttccggcg agcgggcttt 1620 

tcacccgctt tatcgttact tatgtcagca ttcgcacttc tgatacctcc agcatgcctc 1680 

acagcacacc ttcgcaggct tacagaacgc tcccctaccc aacaacgcat aagcgtcgct 1740 

gccgcagctt cggtgcatgg tttagccccg ttacatcttc cgcgcaggcc gactcgacca 1800 

gtgagctatt acgctttctt taaatgatgg ctgcttctaa gccaacatcc tggctgtctg 1860 

ggccttccca catcgtttcc cacttaacca tgactttggg accttagctg gcggtctggg 1920 

ttgtttccct cttcacgacg gacgttagca cccgccgtgt gtctcccgtg ataacattct 1980 

ccggtattcg cagtttgcat cgggttggta agtcgggatg acccccttgc cgaaacagtg 2040 

ctctaccccc ggagatgaat tcacgaggcg ctacctaaat agctttcggg gagaaccagc 2100 

tatctcccgg tttgattggc ctttcacccc cagccacaag tcatccgcta atttttcaac 2160 

attagtcggt tcggtcctcc agttagtgtt acccaacctt caacctgccc atggctagat 2220 

caccgggttt cgggtctata ccctgcaact taacgcccag ttaagactcg gtttcccttc 2280 

ggctccccta ttcggttaac cttgctacag aatataagtc gctgacccat tatacaaaag 2340 

gtacgcagtc acacgcctaa gcgtgctccc actgcttgta cgtacacggt ttcaggttct 24 00 

ttttcactcc cctcgccggg gttcttttcg cctttccctc acggtactgg ttcactatcg 24 60 

gtcagtcagg agtatttagc cttggaggat ggtcccccca tattcagaca ggataccacg 2520 

tgtcccgccc tactcatcga gctcacagca tgtgcatttt tgtgtacggg gctgtcaccc 2580 

tgtatcgcgc gcctttccag acgcttccac taacacacac actgattcag gctctgggct 2640 

gctccccgtt cgctcgccgc tactggggga atctcggttg atttcttttc ctcggggtac 2700 

ttagatgttt cagttccccc ggttcgcctc attaacctat ggattcagtt aatgatagtg 2760 

tgtcgaaaca cactgggttt ccccattcgg aaatcgccgg ttataacggt tcatatcacc 2820 

ttaccgacgc ttatcgcaga ttagcacgtc cttcatcgcc tctgactgcc agggcatcca 2880 

ccgtgtacgc ttagtcgctt aacc ~ 2904 

<210> 240 
<211> 120 
<212> DNA 
<213> E. Coli 



<400> 240 

atgcctggca gttccctact ctcgcatggg gagaccccac actaccatcg gcgctacggc 60 
gtttcacttc tgagttcggc atggggtcag gtgggaccac cgcgctacgg ccgccaggca 120 

<210> 241 
<211> 76 
<212> DNA 
<213> E . Coli 

<400> 241 

gtccccttcg tctagaggcc caggacaccg ccctttcacg gcggtaacag gggttcgaat 60 
cccctagggg acgcca 7g 

<210> 242 

<211> 1549 

<212> DNA 

<213> E. Coli 



<400> 242 

aaattgaaga gtttgatcat ggctcagatt gaacgctggc ggcaggccta acacatgcaa 60 

gtcgaacggt aacaggaagc agcttgctgc ttcgctgacg agtggcggac gggtgagtaa 120 

tgtctgggaa gctgcctgat ggagggggat aactactgga aacggtagct aataccgcat 180 

aatgtcgcaa gaccaaagag ggggaccttc gggcctcttg ccatcggatg tgcccagatg 240 

ggattagctt gttggtgggg taacggctca ccaaggcgac gatccctagc tggtctgaga 300 

ggatgaccag ccacactgga actgagacac ggtccagact cctacgggag gcagcagtgg 360 

ggaatattgc acaatgggcg caagcctgat gcagccatgc cgcgtgtatg aagaaggcct 420 
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tcgggttgta aagtactttc agcggggagg aagggagtaa agttaatacc tttgctcatt 4 80 

gacgttaccc gcagaagaag caccggctaa ctccgtgcca gcagccgcgg taatacggag 540 

ggtgcaagcg ttaatcggaa ttactgggcg taaagcgcac gcaggcgggt tggttaagtc 600 

agatgtgaaa tccccgggct caacctggga actgcatctg atactggcaa gcttgagtct 660 

cgtagagggg ggtagaattc caggtgtagc ggtgaaatgc gtagagatct ggaggaatac 720 

cggtggcgaa ggcggccccc tggacgaaga ctgacgctca ggtgcgaaag cgtggggagc 780 

aaacaggatt agataccctg gtagtccacg ccgtaaacga tgtcgacttg gaggttgtgc 840 

ccttgaggcg tggcttccgg agctaacgcg ttaagtcgac cgcctgggga gtacggccgc 900 

aaggttaaaa ctcaaatgaa ttgacggggg cccgcacaag cggtggagca tgtggtttaa 960 

ttcgatgcaa cgcgaagaac cttacctggt cttgacatcc acggaagttt tcagagatga 1020 

gaatgtgcct tcgggaaccg tgagacaggt gctgcatggc tgtcgtcagc tcgtgttgtg 1080 

aaatgttggg ttaagtcccg caacgagcgc aacccttatc ctttgttgcc agcggtccgg 1140 

ccgggaactc aaaggagact gccagtgata aactggagga aggtggggat gacgtcaagt 1200 

catcatggcc cttacgacca gggctacaca cgtgctacaa tggcgcatac aaagagaagc 1260 

gacctcgcga gagcaagcgg acctcataaa gtgcgtcgta gtccggattg gagtctgcaa 1320 

ctcgactcca tgaagtcgga atcgctagta atcgtggatc agaatgccac ggtgaatacg 1380 

ttcccgggcc ttgtacacac cgcccgtcac accatgggag tgggttgcaa aagaagtagg 1440 

tagcttaacc ttcgggaggg cgcttaccac tttgtgattc atgactgggg tgaagtcgta 1500 

acaaggtaac cgtaggggaa cctgcggttg gatcacctcc ttaccttaa 154 9 



<210> 243 

<211> 221 

<212> PRT 

<213> E. Coli 



<400> 243 



Met 


Asn 


Val 


Phe 


Ser 


Gin 


Thr 


Gin 


1 








5 








Ser 


Leu 


Phe 


His 


Leu 


Leu 


Val 


He 








20 










Leu 


Pro 


Val 


Ser 


He 


Leu 


Gly 


Phe 






35 










40 


Phe 


Pro 


Phe 


He 


Phe 


Leu 


Ala 


Thr 




50 










55 




Ala 


Pro 


Leu 


Ala 


Arg 


Arg 


He 


He 


65 










70 






Leu 


He 


Ser 


Tyr 


Val 


He 


Ser 


Ser 










85 








Gly 


Phe 


Gly 


Ala 


Leu 


Ala 


His 


Phe 








100 










Thr 


Ala 


Ser 


Phe 


Met 


Ala 


Tyr 


Ala 






115 










120 


Val 


Phe 


Asn 


Arg 


Leu 


Arg 


Gin 


Ser 




130 










135 




Ala 


Ser 


Thr 


Leu 


Phe 


Gly 


Asn 


Val 


145 










150 






lie 


Ala 


Phe 


Trp 


Arg 


Ser 


Pro 


Asp 










165 








Glu 


He 


Ala 


Leu 


Val 


Asp 


Tyr 


Cys 








180 










Phe 


Phe 


Leu 


Pro 


Met 


Tyr 


Gly 


Val 






195 










200 


Leu 


Ala 


Asp 


Lys 


Ser 


Glu 


He 


Asn 




210 










215 





Arg 


Tyr 


Lys 


Ala 


Leu 


Phe 


Trp Leu 




10 










15 




Thr 


Ser 


Ser 


Asn 


Tyr 


Leu 


Val 


Gin 


25 










30 






His 


Thr 


Thr 


Trp 


Gly 
45 


Ala 


Phe 


Ser 


Asp 


Leu 


Thr 


Val 
60 


Arg 


He 


Phe 


Gly 


Phe 


Ala 


Val 
75 


Met 


lie 


Pro 


Ala 


Leu 
80 


Leu 


Phe 
90 


Tyr 


Met 


Gly 


Ser 


Trp 

95 


Gin 


Asn 


Leu 


Phe 


Val 


Ala 


Arg 


He 


Ala 


105 










110 






Leu 


Gly 


Gin 


He 


Leu 


Asp Val 


His 










125 








Arg 


Arg 


Trp 


Trp 
140 


Leu 


Ala 


Pro 


Thr 


Ser 


Asp Thr 


Leu 


Ala 


Phe 


Phe 


Phe 






155 










160 


Ala 


Phe 
170 


Met 


Ala 


Glu 


His 


Trp 
175 


Met 


Phe 


Lys 


Val 


Leu 


He 


Ser 


He 


Val 


185 










190 






Leu 


Leu 


Asn 


Met 


Leu 
205 


Leu 


Lys 


Arg 


Ala 


Leu 


Gin 


Ala 


Ser 









220 



<210> 244 
<211> 203 
<212> PRT 
<213> E. Coli 
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<400> 


244 


















Met He Arg 


Trp 


Met 


Asn 


Glu 


Pro 


Leu 


Trp 


Pro Phe He 


Glu Arg Lys 


1 




5 










10 




Id 


Lys Ser Met Arg 


Asn 


Leu 


Val 


Lys 


Tyr 


Val 


Gly He Gly 


Leu Leu Val 




20 










25 






Met Gly Leu Ala 
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Cys Asp Asp Lys Asp Thr Asn Ala 


inr Ala Gin 


35 










40 






45 




Glv Sf»r V^l 


ril a 


Glu 


Ser 


Asn 


Al a 


Thr 


Gly Asn Pro Val 


Asn Leu Leu 


SO 








55 
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Le u 


Ser 


Phe 


Ser 


Leu 


Pro 


Ala 


Asp Met Thr 


Asp Gin Ser 


65 






70 










75 


80 


uiy Lys ijtru 


ui y 


Thr 


Gin 


Ala 


Asn 


Asn 


Met 


His Val Trp 


Ser Asp Ala 






85 










90 




95 


Thr Gly Gin 


Lys 


Ala 


Val 


He 


Val 


He 


Met 


Gly Asp Asp 


Pro Lys Glu 




100 










105 






110 


Asp Leu Ala 


Val 


Leu 


Ala 


Lys 


Arg 


Leu 


Glu 


Asp Gin Gin Arg Ser Arg 


115 










120 






125 




Asp Pro Gin 


Leu 


Gin 


Val 


Val 


Thr 


Asn 


Lys 


Ala He Glu 


Leu Lys Gly 


130 








135 








140 


His Lys Met 


Gin 


Gin 


Leu Asp 


Ser 


He 


He 


Ser Ala Lys 


Gly Gin Thr 


145 






150 










155 


160 


Ala Tyr Ser 


Ser 


Val 


He 


Leu Gly Asn Val Gly Asn Gin Leu Leu Thr 






165 










170 




175 


Met Gin He 


Thr 


Leu 


Pro Ala Asp Asp Gin Gin Lys Ala 


Gin Thr Thr 




180 










185 






190 


Ala Glu Asn 


He 


He 


Asn 


Thr 


Leu 


Val 


He 


Gin 





195 200 



<210> 245 
<211> 324 
<212> PRT 
<213> E. Coli 





<400> 


245 


















Met 


Ala 


Asn 


Met 


Phe 


Ala 


Leu 


He 


Leu 


Val He Ala 


Thr 


Leu Val Thr 


1 








5 










10 




15 


Gly 


He 


Leu 


Trp 


Cys 


Val Asp Lys 


Phe 


Phe Phe Ala 


Pro 


Lys Arg Arg 








20 










25 






30 


Glu Arg 


Gin 


Ala 


Ala 


Ala 


Gin 


Ala 


Ala 


Ala Gly Asp Ser Leu Asp Lys 






35 










40 






45 




Ala 


Thr 


Leu 


Lys 


Lys 


Val 


Ala 


Pro 


Lys 


Pro Gly Trp 


Leu 


Glu Thr Gly 




50 










55 
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Val 


Phe 
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Val 


Leu 


Ala 


He 


Val Leu He 


Val 


Arg Ser Phe 


65 










70 
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80 


He 


Tyr 


Glu 


Pro 


Phe 


Gin 


He 


Pro 


Ser 


Gly Ser Met 


Met 


Pro Thr Leu 










85 










90 




95 


Leu 


He 


Gly Asp 


Phe 


He 


Leu 


Val 


Glu 


Lys Phe Ala 


Tyr Gly He Lys 








100 










105 
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Asp 


Pro 


lie 


Tyr 


Gin 


Lys 


Thr 


Leu 


He 


Glu Thr Gly His 


Pro Lys Arg 






115 










120 






125 


Gly Asp 


He 


Val 


Val 


Phe 


Lys 


Tyr 


Pro 


Glu Asp Pro 


Lys 


Leu Asp Tyr 




130 










135 






140 
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Lys 
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Val 
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Pro 
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Thr 


Tyr Asp Pro 


145 










150 
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Val 
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Thr 


He 


Gin 


Pro 
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Glu 


Pro Ser Asp 
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Arg 


Asn 


Gly Gly Glu 
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Thr Ser Gly 
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195 
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Phe 


Phe 


Glu 


Val 


Pro 
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Asn 


Glu 




210 
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Ser 


Glu 


Arg 
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Glu 
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Leu 


Gly 
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Val 
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Ala 
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Gin 


Val 
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Val 
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Arg Asp 
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Ala 
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Glu 


Ala 
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Val 


Gly Arg 
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Gin 


Glu 


Gly 


Glu 


Trp 


Pro 
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Gly Gly He His 
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Glu 


Asn Gly He 
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Val 
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His Arg He 


Leu 


Thr 
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Gly 


Met 
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He 
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320 



<210> 246 
<211> 586 
<212> PRT 
<213> E. Coli 



<400> 246 
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Thr 


He 
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Glu 


Glu 
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Leu 


He 
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Leu 
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Gin 


Ser 


Asp 
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Gly 
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Leu 


Val 


Gly 


Gly 


His 


Tyr Glu Val 
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His 


Ser 


He 


Arg 


Leu 


Arg 


His 


Ala 


Val 


Ser Ala Asp Asp Asn 


Phe 
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Thr 


Leu 


Thr 
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Val 


Val 
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Val 
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120 










125 
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Leu 


Arg 
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Phe 
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Gly 
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He 
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Leu 


Gin 


Pro 


Gly 
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135 










140 
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He 
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He 
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145 










150 










155 










160 


Leu 


Leu 


Ala 


Gin 


Pro 


Leu 


Leu 
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He 


Val 


Asn 










165 










170 










175 




Arg 


Glu 


Arg 


Phe 


Asp 


Trp 


Val 


Ala 


Phe 


Asp 


Glu 


Ser Arg 


Pro 


Leu 


Pro 








180 










185 










190 






Val 


Ser 


Val 


Pro 


Ser 


Met 


Pro 


Leu 


Lys 


Leu 


Lys 


Val 


He 


Leu 


Val 
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He 
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Leu 


Ser 


Ala 


Lys Ser Trp 


Thr 


Val 


Met 


65 








70 










75 






80 


Arg 


Phe 


Ser 


Glu Thr 


Glu 


Gin 


Asn 


Gly 


Lys 


Leu Glu Trp 


Leu 


He 


Arg 








85 










90 






95 


Lys 


Asp 


Gly Cys Met 


His 


Cys 


Glu 


Asp 


Pro 


Gly Cys Leu 


Lys 


Ala 


Cys 








100 








105 






110 




Pro 


Ser 


Ala 


Gly Ala 


He 


He 


Gin 


Tyr Ala 


Asn Gly He 


Val 


Asp 


Phe 






115 








120 






125 






Gin 


Ser 


Glu 


Asn Cys 


He 


Gly 


Cys 


Gly Tyr 


Cys He Ala 


Gly Cys 


Pro 




130 








135 








140 








Phe 


Asn 


He 


Pro Arg 


Leu 


Asn 


Lys 


Glu Asp Asn Arg Val 


Tyr 


Lys 


Cys 


145 








150 










155 




160 


Thr 


Leu 


Cys 


Val Asp 
165 


Arg 


Val 


Ser 


Val 


Gly 
170 


Gin Glu Pro 


Ala 


Cys 
175 


Val 


Lys 


Thr 


Cys 


Pro Thr Gly Ala 


He 


His 


Phe 


Gly Thr Lys 


Lys 


Glu 


Met 








180 








185 






190 






Leu 


Glu 


Leu 


Ala Glu 


Gin Arg Val 


Ala 


Lys 


Leu Lys Ala 


Arg 


Gly 


Tyr 






195 








200 






205 




Glu 


His 


Ala 


Gly Val 


Tyr Asn 


Pro 


Glu 


Gly Val Gly Gly 


Thr 


His 


Val 




210 








215 








220 








Met 


Tyr 


Val 


Leu His 


His 


Ala 


Asp 


Gin 


Pro 


Glu Leu Tyr 


His 


Gly 


Leu 


225 








230 










235 




240 


Pro 


Lys 


Asp 


Pro Lys 


He 


Asp 


Thr 


Ser 


Val 


Ser Leu Trp 


Lys 


Gly Ala 








245 










250 






255 




Leu 


Lys 


Pro 


Leu Ala 


Ala 


Ala 


Gly 


Phe 


He 


Ala Thr Phe 


Ala 


Gly Leu 
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260 265 270 

lie Phe His Tyr lie Gly lie Gly Pro Asn Lys Glu Val Asp Asp Asp 

275 280 285 

Glu Glu Asp His His Glu 
290 
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Met 


Ser 


Lys 


Ser 


Lys 


Met 


He 


Val 


Arg Thr Lys 


Phe 


He Asp Arg Ala 


1 






5 








10 




15 


Cys 


His 


Trp Thr 


Val 


Val 


He 


Cys 


Phe Phe Leu 


Val 


Ala Leu Ser Gly 






20 










25 




30 


lie 


Ser 


Phe 


Phe 


Phe 


Pro 


Thr 


Leu 


Gin Trp Leu 


Thr 


Gin Thr Phe Gly 






35 










40 






45 


Thr 


Pro 


Gin 


Met 


Gly 


Arg 


He 


Leu 


His Pro Phe 


Phe 


Gly He Ala He 




50 










55 






60 




Phe 


Val 


Ala 


Leu 


Met 


Phe 


Met 


Phe 


Val Arg Phe 


Val 


His His Asn He 


65 










70 






75 




80 


Pro 


Asp 


Lys 


Lys 


Asp 


He 


Pro 


Trp 


Leu Leu Asn 


He 


Val Glu Val Leu 








85 








90 




95 


Lys 


Gly 


Asn 


Glu 


His 


Lys 


Val 


Ala 


Asp Val Gly Lys 


Tyr Asn Ala Gly 




100 










105 




110 


Gin 


Lys 


Met 


Met 


Phe 


Trp 


Ser 


He 


Met Ser Met 


He 


Phe Val Leu Leu 




115 










120 






125 


Val 


Thr 


Gly Val 


He 


He 


Trp Arg 


Pro Tyr Phe 


Ala 


Gin Tyr Phe Pro 




130 










135 






140 




Met 


Gin 


Val 


Val 


Arg 


Tyr 


Ser 


Leu 


Leu lie His 


Ala 


Ala Ala Gly He 


145 










150 






155 




160 


lie 


Leu 


He 


His 


Ala 


He 


Leu 


He 


His Met Tyr 


Met 


Ala Phe Trp Val 










165 








170 




175 


Lys 


Gly 


Ser 


He 


Lys 


Gly 


Met 


He 


Glu Gly Lys 


Val 


Ser Arg Arg Trp 




180 










185 




190 


Ala 


Lys 


Lys 


His 


His 


Pro 


Arg Trp Tyr Arg Glu 


He 


Glu Lys Ala Glu 




195 










200 






205 


Ala 


Lys 


Lys 


Glu 


Ser 


Glu 


Glu Gly 


He 








210 










215 
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Met 


Ala Leu 


Leu 


He 


Thr 


Lys 


Lys 


1 






5 








Pro 


Glu Cys 


Pro 


Asn 


Glu 


Ala 


He 






20 










He 


Asn Ser 


Asp 


Lys 


Cys 


Thr 


Glu 




35 










40 


Thr 


Cys Gin 


Lys 


Val 


Cys 


Pro 


He 




50 








55 




Ala 


His Val 


Glu 


Thr 


Glu 


Glu 


Gin 


65 








70 






His 


His Ala Asp 


Lys 


He 












85 









Cys He Asn Cys Asp Met Cys Glu 

10 " * 15 

Ser Met Gly Asp His He Tyr Glu 
25 30 
Cys Val Gly His Tyr Glu Thr Pro 
45 

Pro Asn Thr He Val Lys Asp Pro 
60 

Leu Trp Asp Lys Phe Val Leu Met 
75 80 
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<400> 


270 










Met Gin 


Ser 


Val 


Asp 


Val 


Ala 


He 


1 






5 








Ala Val 


Ala 


Cys 


Gly 


Leu 


Gin 


Gly 






20 










Glu Gin 


Arg 


Val 


Gin 


Glu 


Pro 


Leu 




35 










40 


Arg Val 


Ser 


Ala 


He 


Asn 


Ala 


Ala 


50 










55 




Gly Val 


Trp 


Gin 


Asp 


He 


Leu 


Ser 


65 








70 






Met Glu 


Val 


Trp 


Asp 


Lys 


Asp 


Ser 








85 








Gin Ser 


Met 


Gly 


Tyr 


Ser 


His 


Leu 






100 










lie His 


Tyr 


Ala 


Leu 


Trp 


Asn 


Lys 




115 










120 


Leu Leu 


Ala 


Pro 


Ala 


Glu 


Leu 


Gin 


130 










135 




Thr Phe 


Leu 


Thr 


Leu 


Lys 


Asp 


Gly 


145 








150 






He Gly Ala 


Asp 


Gly 


Ala 


Asn 


Ser 








165 








Pro Leu 


Thr 


Phe 


Trp 


Asp 


Tyr 


Gin 






180 










Arg Thr Glu 


Glu 


Pro 


His 


Asp 


Ala 




195 










200 


Glu Gly 


He 


Leu 


Ala 


Phe 


Leu 


Pro 


210 










215 




He Val 


Trp 


Ser 


Leu 


Ser 


Pro 


Glu 


225 








230 






Ser Glu Asp 


Glu 


Phe 


Asn 


Arg 


Ala 








245 








Leu Gly 


Leu 


Cys 


Lys 


Val 


Glu 


Ser 






260 










Gly Arg Tyr 


Ala 


Arg 


Gin 


Phe 


Ala 




275 










280 


Asp Ala Ala 


His 


Thr 


He 


His 


Pro 


290 










295 




Gly Phe 


Met 


Asp 


Ala 


Ala 


Glu 


Leu 


305 








310 






Arg Gin Gly 


Lys 


Asp 


He 


Gly 


Gin 








325 








Arg Ser Arg 


Lys 


His 


Ser 


Ala 


Ala 






340 










Phe Arg Asp 


Leu 


Phe 


Ser 


Gly 


Thr 




355 










360 


Asp He 


Gly 


Leu 


Lys 


Leu 


Ala 


Asp 


370 










375 




Leu He 


Arg 


Gin 


Ala 


Met 


Gly 


Leu 


385 








390 
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Val 


Gly 


Gly 


Gly 


Met 


Val 


Gly 


Leu 




10 










15 




Ser 


Gly 


Leu 


Arg 


Val 


Ala 


Val 


Leu 


25 










30 






Ala 


Ala 


Asn 


Ala 


Pro 


Pro 


Gin 


Leu 










45 








Ser 


Glu 


Lys 


Leu 


Leu 


Thr 


Arg 


Leu 








60 










Arg 


Arg 


Ala 


Ser 


Cys 


Tyr 


His 


Gly 






75 










80 


Phe 


Gly 


His 


He 


Ser 


Phe 


Asp 


Asp 




90 










95 




Gly 


His 


lie 


Val 


Glu 


Asn 


Ser 


Val 


105 










110 






Ala 


His 


Gin 


Ser 


Ser 


Asp 


He 


Thr 










125 








Gin 


Val 


Ala 


Trp 


Gly 


Glu 


Asn 


Glu 








140 










Ser 


Met 


Leu 


Thr 


Ala 


Arg 


Leu 


Val 






155 










160 


Trp 


Leu 


Arg 


Asn 


Lys 


Ala 


Asp 


He 




170 










175 




His 


His 


Ala 


Leu 


Val 


Ala 


Thr 


He 


185 










190 






Val 


Ala 


Arg 


Gin 


Val 


Phe 


His 


Gly 










205 








Leu 


Ser 


Asp 


Pro 


His 


Leu 


Cys 


Ser 








220 










Glu 


Ala 


Gin 


Arg 


Met 


Gin 


Gin 


Ala 






235 










240 


Leu 


Asn 


He 


Ala 


Phe 


Asp 


Asn 


Arg 




250 










255 




Ala 


Arg 


Gin 


Val 


Phe 


Pro 


Leu 


Thr 


265 










270 






Ser 


His 


Arg 


Leu 


Ala 


Leu 


Val 


Gly 










285 








Leu 


Ala 


Gly 


Gin 


Gly 


Val 


Asn 


Leu 








300 










He 


Ala 


Glu 


Leu 


Lys 


Arg 


Leu 


His 






315 










320 


Tyr 


He 


Tyr 


Leu 


Arg 


Arg 


Tyr 


Glu 




330 










335 




Leu 


Met 


Leu 


Ala 


Gly 


Met 


Gin 


Gly 


345 










350 






Asn 


Pro 


Ala 


Lys 


Lys 


Leu 


Leu 


Arg 










365 








Thr 


Leu 


Pro 


Gly 


Val 


Lys 


Pro 


Gin 








380 










Asn 


Asp 


Leu 


Pro 


Glu 


Trp 


Leu 


Arg 






395 










400 
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<211> 392 
<212> PRT 
<213> E. Coli 

<400> 271 

Met Ser Val He He Val Gly Gly Gly Met Ala Gly Ala Thr Leu Ala 

1 5 10 15 

Leu Ala He Ser Arg Leu Ser His Gly Ala Leu Pro Val His Leu He 



Ala He Ala Leu Ala Ala Gly Thr Cys Gin Gin Leu Ala Arg He Gly 

50 55 60 

Val Trp Gin Ser Leu Ala Asp Cys Ala Thr Ala He Thr Thr Val His 
65 70 75 80 

Val Ser Asp Arg Gly His Ala Gly Phe Val Thr Leu Ala Ala Glu Asp 

85 90 95 

Tyr Gin Leu Ala Ala Leu Gly Gin Val Val Glu Leu His Asn Val Gly 

100 105 110 

Gin Arg Leu Phe Ala Leu Leu Arg Lys Ala Pro Gly Val Thr Leu His 

115 120 125 

Cys Pro Asp Arg Val Ala Asn Val Ala Arg Thr Gin Ser His Val Glu 

130 135 140 

Val Thr Leu Glu Ser Gly Glu Thr Leu Thr Gly Arg Val Leu Val Ala 
145 150 155 - 160 

Ala Asp Gly Thr His Ser Ala Leu Ala Thr Ala Cys Gly Val Asp Trp 

165 170 175 

Gin Gin Glu Pro Tyr Glu Gin Leu Ala Val He Ala Asn Val Ala Thr 

180 185 190 

Ser Val Ala His Glu Gly Arg Ala Phe Glu Arg Phe Thr Gin His Gly 

195 200 205 

Pro Leu Ala Met Leu Pro Met Ser Asp Gly Arg Cys Ser Leu Val Trp 

210 215 220 

Cys His Pro Leu Glu Arg Arg Glu Glu Val Leu Ser Trp Ser Asp Glu 
225 230 235 * 240 

Lys Phe Cys Arg Glu Leu Gin Ser Ala Phe Gly Trp Arg Leu Gly Lys 

245 250 255 

He Thr His Ala Gly Lys Arg Ser Ala Tyr Pro Leu Ala Leu Thr His 

260 265 270 

Ala Ala Arg Ser lie Thr His Arg Thr Val Leu Val Gly Asn Ala Ala 

275 280 285 

Gin Thr Leu His Pro He Ala Gly Gin Gly Phe Asn Leu Gly Met Arg 

290 295 300 

Asp Val Met Ser Leu Ala Glu Thr Leu Thr Gin Ala Gin Glu Arg Gly 
305 310 315 320 

Glu Asp Met Gly Asp Tyr- Gly Val Leu Cys Arg Tyr Gin Gin Arg Arg 

325 330 335 

Gin Ser Asp Arg Glu Ala Thr He Gly Val Thr Asp Ser Leu Val His 

340 345 " 350 

Leu Phe Ala Asn Arg Trp Ala Pro Leu Val Val Gly Arg Asn lie Gly 

355 360 " 365 

Leu Met Thr Met Glu Leu Phe Thr Pro Ala Arg Asp Val Leu Ala Gin 

370 375 380 

Arg Thr Leu Gly Trp Val Ala Arg 
385 390 



<210> 272 
<211> 441 
<212> PRT 
<213> E. Coli 



20 

Glu Ala Thr Ala Pro Glu Ser His 
35 40 



25 30 
Ala His Pro Gly Phe Asp Gly Arg 
45 
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<400> 272 



Met 


Ser 


Glu 


He 


Ser 


Arg 


Gin 


Glu 


1 








5 








Val 


Glu 


Gin 


Met 


Gin 


Pro 


Gly 


Ser 








20 










Glu 


Val 


Thr 


Arg 


Ser 


Ala 


Asp 


Ser 






35 










40 


Asp 


Phe 


Trp 


Tyr 


Phe 


Thr 


Gly 


Phe 




50 










55 




Leu 


He 


Lys 


Ser 


Asp 


Asp 


Thr 


His 


65 










70 






Val 


Arg 


Asp 


Leu 


Thr 


Ala 


Glu 


lie 










85 








Asp Ala 


Ala 


Pro 


Glu 


Lys 


Leu 


Gly 








100 










Glu 


He 


Asn 


Gin 


Gin 


Leu 


Tyr 


Gin 






115 










120 


Tyr 


His 


Ala 


Gin 


Gly 


Glu 


Tyr 


Ala 




130 










135 




Ala 


Leu 


Glu 


Lys 


Leu 


Arg 


Lys 


Gly 


145 










150 






Ala 


Thr 


Met 


He 


Asp 


Trp 


Arg 


Pro 










165 








Lys 


Ser 


Pro 


Glu 


Glu 


lie 


Ala 


Val 








180 










Ala 


Met 


Ala 


His 


Thr 


Arg 


Ala 


Met 






195 










200 


Glu 


Tyr 


His 


Leu 


Glu 


Gly 


Glu 


lie 




210 










215 




Ala Arg 


Tyr 


Pro 


Ser 


Tyr 


Asn 


Thr 


225 










230 






Cys 


lie 


Leu 


His 


Tyr 


Thr 


Glu 


Asn 










245 








Leu 


Val 


Leu 


He 


Asp 


Ala 


Gly 


Cys 








260 










He 


Thr 


Arg 


Thr 


Phe 


Pro 


Val 


Asn 






275 










280 


Glu 


He 


Tyr 


Asp 


He 


Val 


Leu 


Glu 




290 










295 




Tyr 


Arg 


Pro 


Gly 


Thr 


Ser 


He 


Leu 


305 










310 






He 


Met 


Val 


Ser 


Gly 


Leu 


Val 


Lys 










325 








Asp Glu 


Leu 


He 


Ala 


Gin 


Asn 


Ala 








340 










Leu 


Ser 


His 


Trp 


Leu 


Gly 


Leu 


Asp 






355 










360 


Gin Asp 


Arg 


Ser 


Arg 


He 


Leu 


Glu 




370 










375 




Pro Gly 


Leu 


Tyr 


He 


Ala 


Pro 


Asp 


385 










390 






Gly He 


Gly 


He 


Arg 


lie 


Glu 


Asp 










405 








Asn 


Glu 


Asn 


Leu 


Thr 


Ala 


Ser 


Val 








420 










Ala 


Leu 


Met 


Val 


Ala 


Ala 


Arg 


Lys 






435 










440 
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Phe 


Gin 


Arg 


Arg 


Arg 


Gin 


Ala 


Leu 




10 










15 




Ala 


Ala 


Leu 


lie 


Phe 


Ala 


Ala 


Pro 


25 










30 






Glu 


Tyr 


Pro 


Tyr 


Arg 
45 


Gin 


Asn 


Ser 


Asn 


Glu 


Pro 


Glu 


Ala 


Val 


Leu 


Val 








60 










Asn 


His 


Ser 


Val 


Leu 


Phe 


Asn 


Arg 






75 










80 


Trp 


Phe 


Gly 


Arg 


Arg 


Leu 


Gly 


Gin 




90 










95 




Val 


Asp 


Arg 


Ala 


Leu 


Ala 


Phe 


Ser 


105 










110 






Leu 


Leu 


Asn 


Gly 


Leu 


Asp 


Val 


Val 










125 








Tyr 


Ala 


Asp 


Val 


lie 


Val 


Asn 


Ser 








140 










Ser 


Arg 


Gin 


Asn 


Leu 


Thr 


Ala 


Pro 






155 










160 


Val 


Val 


His 


Glu 


Met 


Arg 


Leu 


Phe 




170 










175 




Leu 


Arg 


Arg 


Ala 


Gly 


Glu 


lie 


Thr 


185 










190 






Glu 


Lys 


Cys 


Arg 


Pro 


Gly 


Met 


Phe 










205 








His 


His 


Glu 


Phe 


Asn 


Arg 


His 


Gly 








220 










lie 


Val 


Gly 


Ser 


Gly 


Glu 


Asn 


Gly 






235 










240 


Glu 


Cys 


Glu 


Met 


Arg 


Asp 


Gly 


Asp 




250 










255 




Glu 


Tyr 


Lys 


Gly 


Tyr 


Ala 


Gly 


Asp 


265 










270 






Gly 


Lys 


Phe 


Thr 


Gin 


Ala 


Gin 


Arg 










285 








Ser 


Leu 


Glu 


Thr 


Ser 


Leu 


Arg 


Leu 








300 








Glu 


Val 


Thr 


Gly 


Glu 


Val 


Val 


Arg 






315 










320 


Leu 


Gly 


lie 


Leu 


Lys 


Gly 


Asp 


Val 




330 










335 




His 


Arg 


Pro 


Phe 


Phe 


Met 


His 


Gly 


345 










350 






Val 


His 


Asp 


Val 


Gly 


Val 


Tyr 


Gly 










365 








Pro 


Gly 


Met 


Val 


Leu 


Thr 


Val 


Glu 








380 










Ala 


Glu 


Val 


Pro 


Glu 


Gin 


Tyr 


Arg 






395 










400 


Asp 


He 


Val 


lie 


Thr 


Glu 


Thr 


Gly 




410 










415 




Val 


Lys 


Lys 


Pro 


Glu 


Glu 


lie 


Glu 


425 










430 







Gin 



-109- 



WO 00/44906 



<211> 194 
<212> PRT 
<213> E. Coli 



<400> 273 



Met 


Leu 


Met 


Ser 


He 


Gin 


Asn 


Glu 


Met 


1 








5 










Gin Tyr 


Leu 


Asn 


Gin 


Gin 


Gly 


Thr 


Gly 








20 










25 


Gly Leu 


He 


Ser 


Gly 


Met 


He 


Cys 


Gly 






35 










40 




Leu 


Pro 


Leu 


Leu 


His 


Asp 


Leu 


Thr 


Asn 




50 










55 






Glu 


Leu 


Ala 


Gin 


Ala 


Leu 


Arg 


Lys 


Met 


65 










70 








Leu 


Gin 


Asp 


Asp 


Gly 


Phe 


Leu 


Phe 


Gin 










85 










Asp Val 


Ser 


Val 


Phe 


Asp 


Arg 


Ala 


Asp 








100 










105 


His 


Phe 


Leu 


Leu 


Gly 


Leu 


Gly 


Val 


Thr 






115 










120 




Thr 


Gly 


Glu 


Thr 


Gly 


Glu 


Ala 


He 


Asp 




130 










135 






Leu Gly 


Tyr 


Asp 


Glu 


Asp 


Glu 


Asp 


Gin 


145 










150 








Glu 


Glu 


He 


He 


Glu 


Tyr 


Val 


Arg 


Val 










165 










Thr 


Phe 


Thr 


His 


Pro 


Gin 


Pro 


Thr 


Ala 








180 










185 


Leu 


His 
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<211> 
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274 












Met 


Leu 


Lys 


Leu 


Phe 


Ala 


Lys 


Tyr 


Thr 


1 








5 










Leu 


He 


His 


Trp 


Val 


Val 


Phe 


Gly 


Val 








20 










25 


Asn 


Gin 


Ala 


Leu 


Ala 


Asn 


Phe 


Ala 


Gly 






35 










40 




Ser 


Phe 


Phe 


Ala 


Asn 


Ala 


Lys 


Phe 


Thr 




50 










55 






Met 


Arg 


Tyr 


Met 


Leu 


Tyr 


Val 


Gly 


Phe 


65 










70 








Val 


Gly 


Trp 


Ala 


Ala 


Asp 


Arg 


Cys 


Ala 










85 










Val 


Thr 


Phe 


Ser 


Ala 


He 


Ser 


Leu 


Val 








100 










105 


Phe 


He 


Val 


Phe 


Arg 


Asp 


Ala 


Lys 
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120 
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Pro Gly 


Tyr 


Asn 


Glu 


Met 


Asn 


10 










15 




Leu 


Thr 


Pro 


Ala 


Glu 


Met 


His 










30 






Gly Asn 


Asp 


Asp 


Ser 


Ser 


Trp 








45 








Glu Gly 


Met 


Ala 


Phe 


Gly 


His 






60 










His 


Ser 


Ala 


Thr 


Ser 


Asp 


Ala 




75 










80 


Leu 


Tyr 


Leu 


Pro 


Asp 


Gly 


Asp 


90 










95 




Ala 


Leu 


Ala 


Gly 


Trp 


Val 


Asn 










110 






Gin 


Pro 


Lys 


Leu 


Asp 


Lys 


Val 








125 








Asp 


Leu 


Arg 


Asn 


He 


Ala 


Gin 






140 










Glu 


Glu 


Leu 


Glu 


Met 


Ser 


Leu 




155 










160 


Ala 


Ala 


Leu 


Leu 


Cys 


His 


Asp 


170 










175 




Pro 


Glu 


Val 


Gin 


Lys 


Pro 


Thr 










190 






Ser 


lie 


Gly 


Val 


Leu 


Asn 


Thr 


10 










15 




Cys 


lie 


Tyr 


Val 


Ala 


His 


Thr 










30 






Phe 


Val 


Val 


Ala 


Val 


Ser 


Phe 








45 








Phe 


Lys 


Ala 


Ser 


Thr 


Thr 


Thr 






60 










Met 


Gly 


Thr 


Leu 


Ser 


Ala 


Thr 




75 










80 


Leu 


Pro 


Pro 


Met 


lie 


Thr 


Leu 


90 










95 




Cys 


Gly 


Phe 


Val 


Tyr 


Ser 


Lys 



110 
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<400> 275 

Met Lys lie Ser Leu Val Val Pro Val Phe Asn Glu Glu Glu Ala He 

15 10 15 

Pro He Phe Tyr Lys Thr Val Arg Glu Phe Glu Glu Leu Lys Ser Tyr 

20 25 30 

Glu Val Glu He Val Phe He Asn Asp Gly Ser Lys Asp Ala Thr Glu 

35 40 45 

Ser He He Asn Ala Leu Ala Val Ser Asp Pro Leu Val Val Pro Leu 

50 55 60 

Ser Phe Thr Arg Asn Phe Gly Lys Glu Pro Ala Leu Phe Ala Gly Leu 
65 70 75 80 

Asp His Ala Thr Gly Asp Ala He He Pro He Asp Val Asp Leu Gin 

85 90 95 

Asp Pro He Glu Val He Pro His Leu He Glu Lys Trp Gin Ala Gly 

100 105 ' HO 

Ala Asp Met Val Leu Ala Lys Arg Ser Asp Arg Ser Thr Asp Gly Arg 

115 120 125 

Leu Lys Arg Lys Thr Ala Glu Trp Phe Tyr Lys Leu His Asn Lys He 

130 135 140 

Ser Asn Pro Lys He Glu Glu Asn Val Gly Asp Phe Arg Leu Met Ser 
145 150 155 " 160 

Arg Asp Val Val Glu Asn He Lys Leu Met Pro Glu Arg Asn Leu Phe 

165 170 175 

Met Lys Gly lie Leu Ser Trp Val Gly Gly Lys Thr Asp He Val Glu 

180 185 190 

Tyr Val Arg Ala Glu Arg He Ala Gly Asp Thr Lys Phe Asn Gly Trp 

195 200 205 

Lys Leu Trp Asn Leu Ala Leu Glu Gly He Thr Ser Phe Ser Thr Phe 

210 215 220 

Pro Leu Arg He Trp Thr Tyr He Gly Leu Val Val Ala Ser Val Ala 
225 230 235 240 

Phe He Tyr Gly Ala Trp Met lie Leu Asp Thr He He Phe Gly Asn 

245 250 255 

Ala Val Arg Gly Tyr Pro Ser Leu Leu Val Ser lie Leu Phe Leu Gly 

260 265 270 

Gly lie Gin Met lie Gly He Gly Val Leu Gly Glu Tyr lie Gly Arg 

275 280 285 

Thr Tyr lie Glu Thr Lys Lys Arg Pro Lys Tyr lie lie Lys Arg Val 

290 295 300 

Lys Lys 
305 



<210> 276 
<211> 443 
<212> PRT 
<213> E. Coli 



<400> 


276 










Met Asn 


Lys 


Ala 


lie 


Lys 


Val 


Ser 


1 






5 








lie Cys 


Ala 


Leu 


Ser 


Lys 


Asn 


lie 






20 










Gly Arg 


Ala 


lie 


Lys 


Pro 


Leu 


lie 




35 










40 


Asp Leu 


Pro 


Leu 


Leu 


Tyr 


Lys 


Leu 


50 










55 




Ser Tyr 


Glu 


Tyr 


lie 


Ser 


Ser 


Tyr 


65 








70 






Leu Phe 


lie 


Ser 


lie 


Phe 


Thr 


Glu 



85 



Leu Tyr lie Ser Phe Val Leu lie 

10 15 
Met Met Leu Asn Thr Ser Asp Phe 
25 30 
Glu Asp lie Pro Ala Phe Thr Tyr 
4 5 

Lys Gly His lie Asp Ser lie Asp 

. 60 

Ser Tyr lie Leu Tyr Thr Tyr Val 

75 80 
Tyr Leu Asp Ala Arg Val Leu Ser 
90 95 
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Leu Phe 

Ser Tyr 

Leu Ala 
130 
Phe Tyr 
145 

Ser Leu 

Leu Leu 

lie Val 

Lys Ser 
210 

Ser Tyr 
225 

Phe Gly 

Tyr Val 

Asp He 

Glu Ser 
290 
Ser Lys 
305 

Ser Gin 

He Tyr 

Asn He 

Ala Ser 
370 
Val Ser 
385 

Gly Tyr 
Asp Leu 
Gin Arg 



Leu Lys 
100 
He Lys 
115 

Phe Leu 

Gin Glu 

Thr Cys 

He He 
180 
Tyr Ser 
195 

Val He 

Ser Lys 

Ser Tyr 

Asp Asp 
260 
Ser Phe 
275 

His Lys 

Pro Ser 

Tyr Lys 

Gly Glu 
340 
Phe Lys 
355 

He Phe 

Leu Phe 

Arg Asp 

Cys Leu 
420 
Asn Gin 
435 



Val He 

Thr Glu 

Met Cys 

Gin He 
150 
Lys Asn 
165 

Ser Thr 

Tyr Tyr 

Cys Val 

Gly Val 
230 
Leu Tyr 
245 

Lys Cys 

Gly Ala 

Asp Glu 

Thr He 
310 
Glu Asn 
325 

Ser Asn 

Asn He 

He Arg 

Gly He 
390 
Leu Ser 
405 

Tyr lie 
Asp Asn 



Tyr He Tyr 
105 

Arg Tyr Val 

120 
Ser Ser Ser 
135 

Val He He 

Asn Lys Ser 

Ala Lys Asn 
185 

He Phe Phe 

200 
Val Cys Leu 
215 

Val Glu Leu 

Met Lys Asn 

Val Gly Leu 
265 

Thr Pro Thr 

280 
Thr Phe Ser 
295 

Phe Lys Leu 

Tyr Phe His 

He Leu Thr 
345 

Arg Phe lie 

360 
Asn Asn Lys 
375 

Ser Gin Phe 

Lys His Leu 

Thr Val Val 
425 

Ser Asp Val 
440 



Ser Leu Tyr Ala He Phe Thr 
110 

Thr Leu Phe Thr Phe Phe lie 
125 

Thr Leu Ser Met Phe Ala Ser 
140 

Phe Leu Pro Phe Leu Val Tyr 
155 160 
Met Leu Leu Leu Phe Phe Ser 
170 175 
Gin Phe He Leu Thr Pro Leu 
190 

Asp Arg His Lys Leu He He 
205 

Leu Ala Ser He Phe Ala lie 
220 

Asn Lys Tyr His Ala Thr Tyr 
235 240 
Asn Gly Tyr Lys Met Pro Ser 
250 255 
Asp Ala Trp Gly Asn Lys Phe 
270 

Glu Val Gly Thr Glu Cys Phe 
285 

Asn Ala Leu Phe Leu Leu Val 
300 

Pro Phe Asp Asp Gly Val Met 
315 " 4 320 
Val Tyr Lys Lys Leu His Val 
330 335 
Thr lie Thr Asn lie Lys Asp 
350 

Ser Leu Leu Leu Phe Phe He 
365 

lie Lys Ala Ser Leu Phe Val 
380 

Tyr Val Ser Phe Phe Gly Glu 
395 400 
Phe Gly Met Tyr Phe Ser Phe 
410 415 
Phe Leu lie Tyr Lys lie lie 
430 

Lys His 



<210> 277 

<211> 82 

<212> PRT 

<213> E . Coli 

<400> 277 
Met Gly He Leu Ser Trp lie lie 

1 5 
Ala Lys Trp lie Met Pro Gly Lys 
20 

lie Leu Leu Gly lie Val Gly Ala 

35 40 
Leu Phe Gly Phe Gly Lys Val Asp 
50 55 



Phe Gly Leu lie Ala Gly lie Leu 

10 15 
Asp Gly Gly Gly Phe Phe Met Thr 
25 30 
Val Val Gly Gly Trp He Ser Thr 
45 

Gly Phe Asn Phe Gly Ser Phe Val 
60 
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Val Ala Val lie Gly Ala He Val Val Leu Phe He Tyr Arg Lys He 
65 70 75 ' 80 

Lys Ser 



<210> 278 
<211> 60 
<212> PRT 
<213> E . Coli 



<400> 278 

Met Gly Lys Ala Thr Tyr Thr Val Thr Val Thr Asn Asn Ser Asn Gly 

15 10 15 

Val Ser Val Asp Tyr Glu Thr Glu Thr Pro Met Thr Leu Leu Val Pro 

20 25 30 

Glu Val Ala Ala Glu Val He Lys Asp Leu Val Asn Thr Val Arg Ser 

35 40 45 

Tyr Asp Thr Glu Asn Glu His Asp Val Cys Gly Trp 
50 55 60 



<210> 279 
<211> 119 
<212> PRT 
<213> E. Coli 





<400> 


279 




Met 


Leu 


Gin 


He 


Pro Gin Asn Tyr 


1 








5 


Trp Asn 


Lys 


Gin 


Thr Ala Pro Ala 








20 




Lys 


Gly 


Thr Arg 


Pro Gly Val Tyr 






35 




40 


Ala 


Val 


Lys 


Tyr 


Leu Gly Tyr Ala 




50 






55 


Gin 


Val 


He 


Leu 


He Glu Ala Gly 


65 








70 


Lys 


Trp 


His 


Asn 


He Glu Ala Met 










85 


Asp 


Phe 


Phe 


Val 


Ala Pro Glu Val 








100 




Lys 


Val 


He 


His 


Asn Gly Lys 






115 







He His Thr Arg Ser Thr Pro Phe 

10 15 
Gly He Phe Glu Arg His Leu Asp 
25 30 
Pro Arg Leu Ser Val Met His Gly 
45 

Asp Glu His Ser Ala Glu Pro Asp 
60 

Gin Phe Ala Val Phe Pro Pro Glu 

75 80 
Thr Asp Asp Thr Tyr Phe Asn He 

90 95 
Leu Met Glu Gly Ala Gin Gin Arg 
105 HO 



<210> 280 
<211> 246 
<212> PRT 
<213> E. Coli 



<400> 280 

Met Lys Phe Lys Val He Ala Leu Ala Ala Leu Met Gly He Ser Gly 

15 10 15 

Met Ala Ala Gin Ala Asn Glu Leu Pro Asp Gly Pro His He Val Thr 

20 25 30 

Ser Gly Thr Ala Ser Val Asp Ala Val Pro Asp He Ala Thr Leu Ala 

35 40 45 

He Glu Val Asn Val Ala Ala Lys Asp Ala Ala Thr Ala Lys Lys Gin 

50 55 60 

Ala Asp Glu Arg Val Ala Gin Tyr He Ser Phe Leu Glu Leu Asn Gin 
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65 








70 




He Ala 


Lys 


Lys 


Asp 


He 


Ser Ser 








85 






Tyr Asp 


Tyr 


Gin 


Asp 


Gly 


Lys Ser 






100 








Arg Thr 


Val 


Glu 


Val 


Thr 


Leu Arg 




115 








120 


Leu Asp 


Gly 


Ala 


Leu 


Lys 


Ala Gly 


130 










135 


Leu Gly 


Val 


Ala 


Gin 


Pro 


Asp Ala 


145 








150 




Ala He 


Asp 


Asn 


Ala 


He 


His Gin 








165 






His Arg 


Lys 


Leu 


Gly 


Pro 


Val Tyr 






180 








Tyr Gin 


Pro 


Ser 


Pro 


Met 


Val Arg 




195 








200 


Val Ser 


Ala 


Gin 


Glu 


Thr 


Tyr Glu 


210 










215 


Gin Val 


Asp 


Val 


Val 


Phe 


Gin Leu 


225 








230 




Lys Thr 


Pro 


Ala 


Ala 


Gin 










245 







75 80 



Ala 


Asn 


Leu 


Arg 


Thr 


Gin 


Pro Asp 




90 










95 




He 


Leu 


Lys 


Gly 


Tyr 


Arg 


Ala 


Val 


105 










110 






Gin 


Leu Asp 


Lys 


Leu 


Asn 


Ser 


Leu 










125 








Leu 


Asn 


Glu 


He 
140 


Arg 


Ser 


val 


Ser 


Tyr 


Lys 


Asp 
155 


Lys 


Ala 


Arg 


Lys 


Ala 
160 


Ala 


Gin 
170 


Glu 


Leu 


Ala 


Asn 


Gly 
175 


Phe 


Ser 


Val 


Arg 


Tyr 


His 


Val 


Ser 


Asn 


185 










190 






Met 


Met 


Lys 


Ala 


Asp Ala 


Ala 


Pro 










205 








Gin 


Ala 


Ala 


He 


Gin 


Phe 


Asp Asp 








220 










Glu 


Pro 


Val 
235 


Asp 


Gin 


Gin 


Pro 


Ala 
240 



<210> 281 
<211> 464 
<212> PRT 
<213> E. Coli 



<400> 281 



Met Leu 


Leu 


Leu 


Asp 


Ala 


Cys 


Ser 


Gin 


Met 


Cys 


Pro 


Ser 


Phe Arg Arg 


1 






5 










10 










15 


Phe Gin 


Thr 


Val 


Phe 


His 


Asn 


Ser 


Ser 


He 


Phe 


Leu 


Pro 


Tyr 


Trp Leu 






20 










25 










30 




Ala Thr 


Leu 


Val 


Ser 


Phe 


Arg 


Glu 


Thr 


Phe 


Gin 


Glu 


Glu 


Lys 


Leu Leu 




35 










40 










45 






Thr Met 


Lys 


Gly 


Ser 


Tyr 


Lys 


Ser Arg 


Trp Val 


He 


Val 


He 


Val Val 


50 










55 










60 








Val He 


Ala 


Ala 


He 


Ala 


Ala 


Phe 


Trp 


Phe 


Trp 


Gin 


Gly Arg Asn Asp 


65 








70 










75 








80 


Ser Arg 


Ser 


Ala 


Ala 


Pro Gly Ala Thr 


Lys 


Gin 


Ala 


Gin 


Gin 


Ser Pro 








85 










90 










95 


Ala Gly 


Gly 


Arg 


Arg 


Gly Met 


Arg 


Ser 


Gly Pro Leu 


Ala 


Pro 


Val Gin 






100 










105 










110 




Ala Ala 


Thr 


Ala 


Val 


Glu 


Gin 


Ala 


Val 


Pro 


Arg 


Tyr 


Leu 


Thr 


Gly Leu 




115 










120 










125 






Gly Thr 


He 


Thr 


Ala 


Ala 


Asn 


Thr 


Val 


Thr 


Val 


Arg 


Ser 


Arg 


Val Asp 


130 










135 










140 








Gly Gin 


Leu 


He 


Ala 


Leu 


His 


Phe 


Gin 


Glu 


Gly 


Gin 


Gin 


Val 


Lys Ala 


145 








150 










155 








160 


Gly Asp 


Leu 


Leu 


Ala 


Glu 


He 


Asp 


Pro 


Ser 


Gin 


Phe 


Lys 


Val 


Ala Leu 








165 










170 










175 


Ala Gin 


Ala 


Gin 


Gly 


Gin 


Leu 


Ala 


Lys 


Asp 


Lys 


Ala 


Thr 


Leu 


Ala Asn 






180 










185 










190 




Ala Arg 


Arg 


Asp 


Leu 


Ala 


Arg 


Tyr 


Gin 


Gin 


Leu 


Ala 


Lys 


Thr 


Asn Leu 




195 










200 










205 






Val Ser 


Arg 


Gin 


Glu 


Leu 


Asp Ala 


Gin 


Gin 


Ala 


Leu 


Val 


Ser 


Glu Thr 


210 










215 










220 








Glu Gly 


Thr 


He 


Lys 


Ala 


Asp 


Glu 


Ala 


Ser 


Val 


Ala 


Ser 


Ala 


Gin Leu 


225 








230 










235 








240 
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Gin 


Leu 


Asp 


Trp 


Ser 
245 


Arg 


He 


Thr 


Ala 


Pro 
250 


Val 


Asp 


Gly 


Ara 


Val 
255 


Glv 


Leu 


Lys 


Gin 


Val 


Asp 


Val Gly 


Asn 


Gin 


He 


Ser 


Ser 


Gly 


Asp 


Thr 


Thr 








260 










265 










270 






Gly 


He 


Val 
275 


Val 


He 


Thr 


Gin 


Thr 
280 


His 


Pro 


He 


Asp 


Leu 
285 


Val 


Phe 


Thr 


Leu 


Pro 


Glu 


Ser 


Asp 


He 


Ala 


Thr 


Val 


Val 


Gin 


Ala 


Gin 


Lys 


Ala 


Glv 




290 










295 










300 








Lys 


Pro 


Leu 


Val 


Val 


Glu 


Ala 


Trp 


Asp 


Arg 


Thr 


Asn 


Ser 


Lvs 


Lvs 


Leu 


305 










310 










315 










320 


Ser 


Glu 


Gly 


Thr 


Leu 
325 


Leu 


Ser 


Leu 


Asp 


Asn 
330 


Gin 


He 


Asp 

IT 


Ala 


Thr 
335 


Thr 


Gly Thr 


He 


Lys 


Val 


Lys 


Ala 


Arg 


Phe 


Asn 


Asn 


Gin 


Asp 


Asp 


Ala 


Leu 








340 










345 










350 






Phe 


Pro 


Asn 
355 


Gin 


Phe 


Val 


Asn 


Ala 
360 


Ara 


Met 


Leu 


Val 


Asp 
365 


Thr 


Glu 


Gin 


Asn 


Ala 
370 


Val 


Val 


lie 


Pro 


Thr 
375 


Ala 


Ala 


Leu 


Gin 


Met 
380 


Glv 


Asn 


Glu 


Glv 


His 


Phe 


Val 


Trp 


Val 


Leu 


Asn 


Ser 


Glu 


Asn 


Lvs 


Val 


Ser 


Lvs 


His 


Leu 


385 










390 










395 










400 


Val 


Thr 


Pro 


Glv 


He 
405 


Gin 


Asp 


Ser 


Gin 


Lvs 
410 


Val 


Val 


He 


Arg 


Ala 
415 


Glv 


He 


Ser 


Ala 


Gly 
420 


Asp 


Arg 


Val 


Val 


Thr 
425 


Asp 


Gly 


He 


Asp 


Arg 
430 


Leu 


Thr 


Glu 


Gly 


Ala 
435 


Lys 


Val 


Glu 


Val 


Val 
440 


Glu 


Ala 


Gin 


Ser 


Ala 
445 


Thr 


Thr 


Pro 


Glu 


Glu 
450 


Lys 


Ala 


Thr 


Ser 


Arg 
455 


Glu 


Tyr 


Ala 


Lys 


Lys 
460 


Gly 


Ala 


Arg 


Ser 



<210> 282 
<211> 1040 
<212> PRT 
<213> E. Coli 

<400> 282 



Met 


Gin 


Val 


Leu 


Pro 


Pro 


Ser 


Ser 


Thr 


Gly 


Gly 


Pro 


Ser 


Arg 


Leu 


Phe 


1 








5 










10 










15 




He 


Met 


Arg 


Pro 


Val 


Ala 


Thr 


Thr 


Leu 


Leu 


Met 


Val 


Ala 


He 


Leu 


Leu 








20 










25 










30 






Ala 


Gly 


He 


He 


Gly 


Tyr Arg 


Ala 


Leu 


Pro 


Val 


Ser 


Ala 


Leu 


Pro 


Glu 






35 










40 










45 








Val 


Asp 


Tyr 


Pro 


Thr 


He 


Gin 


Val 


Val 


Thr 


Leu 


Tyr 


Pro 


Gly 


Ala 


Ser 




50 










55 










60 










Pro Asp 


Val 


Met 


Thr 


Ser 


Ala 


Val 


Thr 


Ala 


Pro 


Leu 


Glu 


Arg 


Gin 


Phe 


65 










70 










75 










80 


Gly Gin 


Met 


Ser 


Gly 


Leu 


Lys 


Gin 


Met 


Ser 


Ser 


Gin 


Ser 


Ser 


Gly 


Gly 










85 










90 










95 




Ala 


Ser 


Val 


He 


Thr 


Leu 


Gin 


Phe 


Gin 


Leu 


Thr 


Leu 


Pro 


Leu 


Asp 


Val 








100 










105 










110 






Ala 


Glu 


Gin 


Glu 


Val 


Gin 


Ala 


Ala 


He 


Asn 


Ala 


Ala 


Thr 


Asn 


Leu 


Leu 






115 










120 










125 








Pro 


Ser 


Asp 


Leu 


Pro 


Asn 


Pro 


Pro 


Val 


Tyr 


Ser 


Lys 


Val 


Asn 


Pro 


Ala 




130 










135 










140 










Asp 


Pro 


Pro 


He 


Met 


Thr 


Leu 


Ala 


Val 


Thr 


Ser 


Thr 


Ala 


Met 


Pro 


Met 


145 










150 










155 










160 


Thr 


Gin 


Val 


Glu 


Asp 


Met 


Val 


Glu 


Thr 


Arg 


Val 


Ala 


Gin 


Lys 


He 


Ser 










165 










170 










175 




Gin 


He 


Ser 


Gly 


Val 


Gly 


Leu 


Val 


Thr 


Leu 


Ser 


Gly 


Gly 


Gin 


Arg 


Pro 








180 










185 










190 






Ala 


Val 


Arg 


Val 


Lys 


Leu 


Asn 


Ala 


Gin 


Ala 


He 


Ala 


Ala 


Leu 


Gly 


Leu 
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195 200 205 



Thr 


Ser 


Glu 


Thr 


Val 


Arg 


Thr 


Ala 


He 


Thr Gly Ala 


Asn 


Val 


Asn 


Ser 




210 










215 








220 










Ala 


Lys 


Gly 


Ser 


Leu Asp Gly 


Pro 


Ser 


Arg Ala 


Val 


Thr 


Leu 


Ser 


Ala 


225 










230 








235 










240 


Asn 


Asp 


Gin 


Met 


Gin 


Ser 


Ala 


Glu 


Glu 


Tyr Arg 


Gin 


Leu 


lie 


He 


Ala 










245 










250 








255 




Tyr Gin Asn 


Gly 


Ala 


Pro 


He 


Arg 


Leu 


Gly Asp Val 


Ala 


Thr 


Val 


Glu 








260 










265 








270 






Gin Gly Ala Glu 


Asn 


Ser 


Trp 


Leu 


Gly 


Ala Trp Ala 


Asn 


Lys 


Glu 


Gin 






275 










280 








285 








Ala 


He 


Val 


Met 


Asn 


Val 


Gin 


Arg Gin 


Pro Gly Ala Asn 


He 


He 


Ser 




290 










295 








300 










Thr 


Ala 


Asp 


Ser 


He 


Arg Gin 


Met 


Leu 


Pro Gin 


Leu 


Thr 


Glu 


Ser 


Leu 


305 










310 








315 










320 


Pro 


Lys 


Ser 


Val 


Lys 


Val 


Thr 


Val 


Leu 


Ser Asp Arg Thr Thr Asn 


He 










325 










330 








335 




Arg Ala 


Ser 


Val 


Asp Asp 


Thr 


Gin 


Phe 


Glu Leu 


Met 


Met 


Ala 


He 


Ala 








340 










345 








350 






Leu 


Val 


Val 


Met 


He 


He 


Tyr 


Leu 


Phe 


Leu Arg 


Asn 


He 


Pro 


Ala 


Thr 






355 










360 








365 








lie 


He 


Pro Gly 


Val 


Ala 


Val 


Pro 


Leu 


Ser Leu 


He 


Gly Thr 


Phe 


Ala 




370 










375 








380 










Val 


Met 


Val 


Phe 


Leu Asp 


Phe 


Ser 


He 


Asn Asn 


Leu 


Thr 


Leu 


Met 


Ala 


385 










390 








395 










400 


Leu 


Thr 


He 


Ala 


Thr 


Gly 


Phe 


Val 


Val 


Asp Asp Ala 


He 


Val 


Val 


He 










405 










410 








415 




Glu 


Asn 


He 


Ser 


Arg 


Tyr 


He 


Glu 


Lys 


Gly Glu 


Lys 


Pro 


Leu 


Ala 


Ala 








420 










425 








430 






Ala 


Leu 


Lys 


Gly 


Ala 


Gly Glu 


He 


Gly 


Phe Thr 


lie 


He 


Ser 


Leu 


Thr 






435 










440 








445 








Phe 


Ser 


Leu 


He 


Ala 


Val 


Leu 


He 


Pro 


Leu Leu 


Phe 


Met 


Gly Asp 


lie 




450 










455 








4 60 










Val 


Gly Arg 


Leu 


Phe Arg 


Glu 


Phe 


Ala 


He Thr 


Leu 


Ala 


Val 


Ala 


He 


4 65 










470 








475 










480 


Leu 


He 


Ser 


Ala 


Val 


Val 


Ser 


Leu 


Thr 


Leu Thr 


Pro 


Met 


Met 


Cys 


Ala 










485 










4 90 








495 




Arg Met 


Leu 


Ser 


Gin 


Glu 


Ser 


Leu Arg 


Lys Gin 


Asn 


Arg 


Phe 


Ser Arg 








500 










505 








510 






Ala 


Ser 


Glu 


Lys 


Met 


Phe 


Asp Arg 


He 


He Ala 


Ala 


Tyr Gly Arg Gly 






515 










520 








525 








Leu 


Ala 


Lys 


Val 


Leu 


Asn 


His 


Pro 


Trp 


Leu Thr 


Leu 


Ser 


Val 


Ala 


Leu 




530 










535 








540 










Ser 


Thr 


Leu 


Leu 


Leu 


Ser 


Val 


Leu 


Leu 


Trp Val 


Phe 


He 


Pro 


Lys 


Gly 


545 










550 








555 








560 


Phe 


Phe 


Pro 


Val 


Gin Asp Asn Gly He 


He Gin Gly Thr Leu Gin Ala 










565 










570 








575 




Pro 


Gin 


Ser 


Ser 


Ser 


Phe 


Ala 


Asn 


Met 


Ala Gin Arg 


Gin Arg 


Gin 


Val 








580 










585 








590 






Ala 


Asp 


Val 


He 


Leu 


Gin 


Asp 


Pro 


Ala 


Val Gin 


Ser 


Leu 


Thr 


Ser 


Phe 






595 










600 








605 








Val 


Gly Val Asp 


Gly Thr 


Asn 


Pro 


Ser 


Leu Asn 


Ser 


Ala Arg 


Leu 


Gin 




610 










615 








620 










lie 


Asn 


Leu Lys 


Pro 


Leu 


Asp 


Glu 


Arg 


Asp Asp Arg 


Val 


Gin 


Lys 


Val 


625 










630 








635 










640 


lie Ala Arg 


Leu 


Gin 


Thr 


Ala 


Val 


Asp 


Lys Val 


Pro 


Gly Val 


Asp 


Leu 










645 










650 








655 




Phe 


Leu 


Gin 


Pro 


Thr 


Gin 


Asp 


Leu 


Thr 


He Asp 


Thr 


Gin 


Val 


Ser Arg 








660 










665 








670 






Thr 


Gin 


Tyr Gin 


Phe 


Thr 


Leu 


Gin 


Ala 


Thr Ser 


Leu Asp Ala 


Leu 


Ser 



675 680 685 
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Thr 


Trp 


Val 


Pro 


Gin 


Leu 


Met 


Glu 


Lys 


Leu 


Gin 


Gin 


Leu 


Pro 


Gin 


Leu 




690 










695 










700 










Ser Asp 


Val 


Ser 


Ser 


Asp 


Trp 


Gin 


Asp 


Lys 


Gly 


Leu 


Val 


Ala 


Tyr 


Val 


705 










710 










715 










720 


Asn 


Val 


Asp 


Arg 


Asp 


Ser 


Ala 


Ser 


Arg 


Leu 


Gly 


He 


Ser 


Met 


Ala 


Asp 










725 










730 










735 




Val 


Asp 


Asn 


Ala 


Leu 


Tyr 


Asn 


Ala 


Phe 


Gly Gin 


Arg 


Leu 


He 


Ser 


Thr 








740 










745 










750 






lie 


Tyr 


Thr 


Gin 


Ala 


Asn 


Gin 


Tyr 


Arg 


Val 


Val 


Leu 


Glu 


His 


Asn 


Thr 






755 










7 60 










765 








Glu 


Asn 


Thr 


Pro 


Gly 


Leu 


Ala 


Ala 


Leu 


Asp 


Thr 


He 


Arg 


Leu 


Thr 


Ser 




770 










775 










780 










Ser 


Asp 


Gly 


Gly 


Val 


Val 


Pro 


Leu 


Ser 


Ser 


He 


Ala 


Lys 


He 


Glu 


Gin 


785 










790 










795 










800 


Arg 


Phe 


Ala 


Pro 


Leu 


Ser 


He 


Asn 


His 


Leu Asp 


Gin 


Phe 


Pro 


Val 


Thr 








805 










810 










815 




Thr 


He 


Ser 


Phe 


Asn 


Val 


Pro 


Asp 


Asn 


Tyr 


Ser 


Leu 


Gly 


Asp 


Ala 


Val 








820 










825 










830 






Gin 


Ala 


He 


Met 


Asp 


Thr 


Glu 


Lys 


Thr 


Leu 


Asn 


Leu 


Pro 


Val 


Asp 


He 






835 










840 










845 








Thr 


Thr 


Gin 


Phe 


Gin 


Gly 


Ser 


Thr 


Leu 


Ala 


Phe 


Gin 


Ser 


Ala 


Leu Gly 




850 










855 










860 










Ser 


Thr 


Val 


Trp 


Leu 


He 


Val 


Ala 


Ala 


Val 


Val 


Ala 


Met 


Tyr 


He 


Val 


865 










870 










875 










880 


Leu 


Gly 


He 


Leu 


Tyr 


Glu 


Ser 


Phe 


He 


His 


Pro 


He 


Thr 


He 


Leu 


Ser 










885 










890 










895 




Thr 


Leu 


Pro 


Thr 


Ala 


Gly 


Val 


Gly 


Ala 


Leu 


Leu 


Ala 


Leu 


Leu 


He 


Ala 








900 










905 










910 






Gly 


Ser 


Glu 


Leu 


Asp 


Val 


He 


Ala 


He 


He 


Gly 


He 


He 


Leu 


Leu 


He 






915 










920 










925 








Gly 


He 


Val 


Lys 


Lys 


Asn 


Ala 


He 


Met 


Met 


He 


Asp 


Phe 


Ala 


Leu 


Ala 




930 










935 










940 










Ala 


Glu 


Arg 


Glu 


Gin 


Gly 


Met 


Ser 


Pro 


Arg Glu 


Ala 


He 


Tyr 


Gin Ala 


945 










950 










955 










960 


Cys 


Leu 


Leu 


Arg 


Phe 


Arg 


Pro 


He 


Leu 


Met 


Thr 


Thr 


Leu 


Ala 


Ala 


Leu 








965 










970 










975 




Leu 


Gly 


Ala 


Leu 


Pro 


Leu 


Met 


Leu 


Ser 


Thr Gly 


Val 


Gly 


Ala 


Glu 


Leu 








980 










985 










990 






Arg Arg 


Pro 


Leu 


Gly 


He 


Gly 


Met 


Val 


Gly Gly 


Leu 


He 


Val 


Ser 


Gin 






995 










1000 








1005 






Val 


Leu 


Thr 


Leu 


Phe 


Thr 


Thr 


Pro 


Val 


He 


Tyr 


Leu 


Leu 


Phe 


Asp 


Arg 




1010 








1015 








1020 








Leu 


Ala 


Leu 


Trp 


Thr 


Lys 


Ser Arg 


Phe 


Ala Arg 


His 


Glu 


Glu 


Glu 


Ala 



1025 1030 1035 1040 



<210> 283 
<211> 1025 
<212> PRT 
<213> E. Coli 

<400> 283 
Met Lys Phe Phe Ala Leu Phe He 

1 * 5 
Leu Ser Val Ala He Thr Leu Cys 
20 

Pro Val Ala Pro Leu Pro Gin Val 

35 40 
Ala Ser Leu Pro Gly Ala Ser Pro 

50 55 
Thr Pro Leu Glu Arg Ser Leu Gly 



Tyr Arg Pro Val Ala Thr He Leu 

10 15 
Gly He Leu Gly Phe Arg Met Leu 
25 30 
Asp Phe Pro Val He He Val Ser 
45 

Glu Thr Met Ala Ser Ser Val Ala 
60 

Arg He Ala Gly Val Ser Glu Met 
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6 5 70 75 80 

Thr Ser Ser Ser Ser Leu Gly Ser Thr Arg lie lie Leu Gin Phe Asp 

85 90 95 

Phe Asp Arg Asp lie Asn Gly Ala Ala Arg Asp Val Gin Ala Ala lie 

100 105 HO 

Asn Ala Ala Gin Ser Leu Leu Pro Ser Gly Met Pro Ser Arg Pro Thr 

115 120 125 

Tyr Arg Lys Ala Asn Pro Ser Asp Ala Pro He Met He Leu Thr Leu 

130 135 140 

Thr Ser Asp Thr Tyr Ser Gin Gly Glu Leu Tyr Asp Phe Ala Ser Thr 
145 150 155 160 

Gin Leu Ala Pro Thr He Ser Gin He Asp Gly Val Gly Asp Val Asp 

165 170 ' 175 

Val Gly Gly Ser Ser Leu Pro Ala Val Arg Val Gly Leu Asn Pro Gin 

180 185 ' 190 

Ala Leu Phe Asn Gin Gly Val Ser Leu Asp Asp Val Arg Thr Ala Val 

195 200 205 

Ser Asn Ala Asn Val Arg Lys Pro Gin Gly Ala Leu Glu Asp Gly Thr 

210 215 220 

His Arg Trp Gin He Gin Thr Asn Asp Glu Leu Lys Thr Ala Ala Glu 
225 230 235 240 

Tyr Gin Pro Leu He He His Tyr Asn Asn Gly Gly Ala Val Arg Leu 

245 250 255 

Gly Asp Val Ala Thr Val Thr Asp Ser Val Gin Asp Val Arg Asn Ala 

260 265 270 

Gly Met Thr Asn Ala Lys Pro Ala He Leu Leu Met He Arg Lys Leu 

275 280 285 

Pro Glu Ala Asn He He Gin Thr Val Asp Ser He Arg Ala Lys Leu 

290 295 300 

Pro Glu Leu Gin Glu Thr He Pro Ala Ala He Asp Leu Gin He Ala 
305 310 315 320 

Gin Asp Arg Ser Pro Thr He Arg Ala Ser Leu Glu Glu Val Glu Gin 

325 330 335 

Thr Leu He He Ser Val Ala Leu Val He Leu Val Val Phe Leu Phe 

340 345 350 

Leu Arg Ser Gly Arg Ala Thr lie He Pro Ala Val Ser Val Pro Val 

355 360 365 

Ser Leu He Gly Thr Phe Ala Ala Met Tyr Leu Cys Gly Phe Ser Leu 

370 375 380 

Asn Asn Leu Ser Leu Met Ala Leu Thr He Ala Thr Gly Phe Val Val 
385 390 395 400 

Asp Asp Ala He Val Val Leu Glu Asn lie Ala Arg His Leu Glu Ala 

405 410 415 

Gly Met Lys Pro Leu Gin Ala Ala Leu Gin Gly Thr Arg Glu Val Gly 

420 425 430 

Phe Thr Val Leu Ser Met Ser Leu Ser Leu Val Ala Val Phe Leu Pro 

435 440 445 

Leu Leu Leu Met Gly Gly Leu Pro Gly Arg Leu Leu Arg Glu Phe Ala 

450 455 460 

Val Thr Leu Ser Val Ala lie Gly lie Ser Leu Leu Val Ser Leu Thr 
465 470 475 480 

Leu Thr Pro Met Met Cys Gly Trp Met Leu Lys Ala Ser Lys Pro Arg 

485 490 495 

Glu Gin Lys Arg Leu Arg Gly Phe Gly Arg Met Leu Val Ala Leu Gin 

500 505 510 

Gin Gly Tyr Gly Lys Ser Leu Lys Trp Val Leu Asn His Thr Arg Leu 

515 520 525 

Val Gly Val Val Leu Leu Gly Thr lie Ala Leu Asn lie Trp Leu Tyr 

530 535 540 

lie Ser He Pro Lys Thr Phe Phe Pro Glu Gin Asd Thr Gly Val Leu 
545 550 555 560 
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Met 


Gly 


Gly He Gin Ala 


Asp 


Gin 


Ser 








565 










Gly Lys 


Leu 


Gin Asp 


Phe 


Met 


Lys 


He 








580 








585 


Asp Asn 


Val 


Thr Gly 


Phe 


Thr 


Gly 


Gly 






595 








600 




Met 


Phe 


He 


Thr Leu 


Lys 


Pro 


Arg 


Asp 




610 








615 






Gin 


lie 


He 


Asp Arg 


Leu 


Arg 


Val 


Lys 


625 








630 








Asn 


Leu 


Phe 


Leu Met 


Ala 


Val 


Gin 


Asp 








645 










Ser 


Asn 


Ala 


Ser Tyr 


Gin 


Tyr 


Thr 


Leu 








660 








665 


Leu Arg 


Glu 


Trp Glu 


Pro 


Lys 


He 


Arg 






675 








680 




Glu 


Leu 


Ala Asp Val 


Asn 


Ser 


Asp 


Gin 




690 








695 






Asn 


Leu 


Val 


Tyr Asp Arg 


Asp 


Thr 


Met 


705 








710 








Gin 


Ala 


Ala 


Asn Ser 


Leu 


Leu 


Asn 


Asn 








725 










Ser 


Thr 


He 


Tyr Gin 


Pro 


Met 


Asn 


Gin 








740 








745 


Asp 


Pro 


Arg 


Tyr Thr 


Gin 


Asp 


He 


Ser 






755 








760 




lie 


Asn 


Asn 


Glu Gly 


Lys 


Ala 


He 


Pro 




770 








775 






Gin 


Pro 


Ala 


Asn Ala 


Pro 


Leu 


Ser 


Val 


785 








790 








Ala 


Ser 


Thr 


He Ser 


Phe 


Asn 


Leu 


Pro 








805 










Ala 


Ser 


Ala 


Ala He Asp 


Arg 


Ala 


Met 








820 








825 


Thr 


Val 


Arg Gly Ser 


Phe 


Ala 


Gly 


Thr 






835 








840 




Met 


Asn 


Ser 


Gin Val 


He 


Leu 


He 


He 




850 








855 






lie 


Val 


Leu 


Gly He 


Leu 


Tyr 


Glu 


Ser 


8 65 








870 








Leu 


Ser 


Thr 


Leu Pro 


Ser 


Ala 


Gly 


Val 








885 










Leu 


Phe 


Asn 


Ala Pro 


Phe 


Ser 


Leu. 


He 








900 








905 


Leu 


He 


Gly 


He Val 


Lys 


Lys 


Asn 


Ala 






915 








920 




Leu 


Glu 


Ala Gin Arg 


His 


Gly 


Asn 


Leu 




930 








935 






Gin 


Ala 


Cys 


Leu Leu 


Arg 


Phe 


Arg 


Pro 


945 








950 








Ala 


Leu 


Phe 


Gly Ala 


Leu 


Pro 


Leu 


Val 








965 










Glu 


Leu 


Arg Gin Pro Leu 


Gly 


He 


Thr 








980 








985 


Ser 


Gin 


Leu 


Leu Thr 


Leu 


Tyr 


Thr 


Thr 



995 1000 



Asp Arg Leu Arg Leu Arg Phe Ser Arg 
1010 1015 

Glu 
1025 
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He 


Ser 


Phe 


Gin 


Ala 


Met 


Arg 


570 










575 


He 


Arg 


Asp 


Asp 


Pro 


Ala 


Val 










590 






Ser 


Arg 


Val 


Asn 


Ser 


Gly 


Met 








605 








Glu 


Arg 


Ser 


Glu 


Thr 


Ala 


Gin 






620 










Leu 


Ala 


Lys 


Glu 


Pro 


Gly 


Ala 




635 










640 


He 


Arg 


Val 


Gly 


Gly 


Arg 


Gin 


650 










655 




Leu 


Ser 


Asp 


Asp 


Leu 


Ala 


Ala 










670 






Lys 


Lys 


Leu 


Ala 


Thr 


Leu 


Pro 








685 








Gin 


Asp 


Asn 


Gly 


Ala 


Glu 


Met 






700 










Ala 


Arg 


Leu 


Gly 


He 


Asp 


Val 




715 










720 


Ala 


Phe 


Gly 


Gin 


Arg 


Gin 


He 


730 










735 




Tyr 


Lys 


Val 


Val 


Met 


Glu 


Val 










750 






Ala 


Leu 


Glu 


Lys 


Met 


Phe 


Val 








765 








Leu 


Ser 


Tyr 


Phe 


Ala 


Lys 


Trp 






780 










Asn 


His 


Gin 


Gly 


Leu 


Ser 


Ala 




795 










800 


Thr 


Gly 


Lys 


Ser 


Leu 


Ser 


Asp 


810 










815 




Thr 


Gin 


Leu 


Gly 


Val 


Pro 


Ser 










830 






Ala 


Gin 


Val 


Phe 


Gin 


Glu 


Thr 








845 








Ala 


Ala 


He 


Ala 


Thr 


Val 


Tyr 






860 








Tyr 


Val 


His 


Pro 


Leu 


Thr 


He 




875 










880 


Gly 


Ala 


Leu 


Leu 


Ala 


Leu 


Glu 


890 










895 




Ala 


Leu 


He 


Gly 


He 


Met 


Leu 










910 






He 


Met 


Met 


Val 


Asp 


Phe 


Ala 








925 








Thr 


Pro 


Gin 


Glu 


Ala 


He 


Phe 






940 










He 


Met 


Met 


Thr 


Thr 


Leu 


Ala 




955 










960 


Leu 


Ser 


Gly 


Gly 


Asp 


Gly 


Ser 


970 










97 5 




He 


Val 


Gly 


Gly 


Leu 


Val 


Met 










990 






Pro 


Val 


Val 


Tyr 


Leu 


Phe 


Phe 








1005 






Lys 


Pro 


Lys 


Gin 


Thr 


Val 


Thr 



1020 
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<210> 284 
<211> 471 
<212> PRT 
<213> E. Coli 



<400> 284 



Met Thr 


Asp 


Leu 


Pro 


Asp 


Ser 


Thr 


Arg 


Trp 


Gin 


Leu 


Trp 


He 


Val 


Ala 








5 










10 










15 




Phe Gly 


Phe 


Phe 
20 


Met 


Gin 


Ser 


Leu 


Asp 
25 


Thr 


Thr 


He 


Val 


Asn 
30 


Thr 


Ala 


Leu Pro 


Ser 
35 


Met 


Ala 


Gin 


Ser 


Leu 
40 


Gly 


Glu 


Ser 


Pro 


Leu 
45 


His 


Met 


His 


Met Val 


lie 


Val 


Ser 


Tyr 


Val 


Leu 


Thr 


Val 


Ala 


Val 


Met 


Leu 


Pro 


Ala 


50 










55 










60 










Ser Gly 


Trp 


Leu 


Ala 


Asp 


Lys 


Val 


Gly 


Val 


Arg 


Asn 


He 


Phe 


Phe 


Thr 


65 








70 










75 










80 


Ala lie 


Val 


Leu 


Phe 


Thr 


Leu 


Gly 


Ser 


Leu 


Phe 


Cys 


Ala 


Leu 


Ser 


Gly 








85 










90 










95 


Thr Leu 


Asn 


Glu 
100 


Leu 


Leu 


Leu 


Ala 


Arg 
105 


Ala 


Leu 


Gin 


Gly 


Val 
110 


Gly 


Gly 


Ala Met 


Met 
115 


Val 


Pro 


Val 


Gly 


Arg 
120 


Leu 


Thr 


Val 


Met 


Lys 
125 


He 


Val 


Pro 


Arg Glu 


Gin 


Tyr 


Met 


Ala 


Ala 


Met 


Thr 


Phe 


Val 


Thr 


Leu 


Pro 


Gly 


Gin 


130 










135 










140 










Val Gly 


Pro 


Leu 


Leu 


Gly 


Pro 


Ala 


Leu 


Gly 


Gly 


Leu 


Leu 


Val 


Glu 


Tyr 


145 








150 










155 










160 


Ala Ser 


Trp 


His 


Trp 
165 


He 


Phe 


Leu 


He 


Asn 
170 


He 


Pro 


Val 


Gly 


He 
175 


He 


Gly Ala 


lie 


Ala 
180 


Thr 


Leu 


Leu 


Leu 


Met 
185 


Pro 


Asn 


Tyr 


Thr 


Met 
190 


Gin 


Thr 


Arg Arg 


Phe 
195 


Asp 


Leu 


Ser 


Gly 


Phe 
200 


Leu 


Leu 


Leu 


Ala 


Val 
205 


Gly 


Met 


Ala 


Val Leu 


Thr 


Leu 


Ala 


Leu 


Asp 


Gly 


Ser 


Lys 


Gly 


Thr 


Gly 


Leu 


Ser 


Pro 


210 










215 










220 










Leu Thr 


He 


Ala 


Gly 


Leu 


Val 


Ala 


Val 


Gly 


Val 


Val 


Ala 


Leu 


Val 


Leu 


225 








230 










235 










240 


Tyr Leu 


Leu 


His 


Ala 
245 


Arg 


Asn 


Asn 


Asn 


Arg 
250 


Ala 


Leu 


Phe 


Ser 


Leu 
255 


Lys 


Leu Phe 


Arg 


Thr 
260 


Arg 


Thr 


Phe 


Ser 


Leu 
265 


Gly 


Leu 


Ala 


Gly 


Ser 
270 


Phe 


Ala 


Gly Arg 


He 
275 


Gly 


Ser 


Giy 


Met 


Leu 

280. 


Pro 


Phe 


Met 


Thr 


Pro 
285 


Val 


Phe 


Leu 


Gin lie 


Gly 


Leu 


Gly 


Phe 


Ser 


Pro 


Phe 


His 


Ala 


Gly 


Leu 


Met 


Met 


He 


290 










295 










300 










Pro Met 


Val 


Leu 


Gly 


Ser 


Met 


Gly 


Met 


Lys 


Arg 


He 


Val 


Val 


Gin 


Val 


305 








310 










315 










320 


Val Asn 


Arg 


Phe 


Gly 
325 


Tyr 


Arg 


Arg 


Val 


Leu 
330 


Val 


Ala 


Thr 


Thr 


Leu 
335 


Gly 


Leu Ser 


Leu 


Val 


Thr 


Leu 


Leu 


Phe 


Met 


Thr 


Thr 


Ala 


Leu 


Leu 


Gly 


Trp 






340 










345 










350 




Tyr Tyr 


Val 
355 


Leu 


Pro 


Phe 


Val 


Leu 
360 


Phe 


Leu 


Gin 


Gly 


Met 
365 


Val 


Asn 


Ser 


Thr Arg 


Phe 


Ser 


Ser 


Met 


Asn 


Thr 


Leu 


Thr 


Leu 


Lys 


Asp 


Leu 


Pro 


Asp 


370 










375 










380 










Asn Leu 


Ala 


Ser 


Ser 


Gly 


Asn 


Ser 


Leu 


Leu 


Ser 


Met 


He 


Met 


Gin 


Leu 


385 








390 










395 










400 


Ser Met 


Ser 


lie 


Gly 
405 


Val 


Thr 


He 


Ala 


Gly 
410 


Leu 


Leu 


Leu 


Gly 


Leu 
415 


Phe 


Gly Ser 


Gin 


His 
420 


Val 


Ser 


Val 


Asp 


Ser 
425 


Gly 


Thr 


Thr 


Gin 


Thr 
430 


Val 


Phe 
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Met Tyr Thr Trp Leu Ser Met Ala Leu He He Ala Leu Pro Ala Phe 

435 440 445 

He Phe Ala Arg Val Pro Asn Asp Thr His Gin Asn Val Ala He Ser 

450 455 460 

Arg Arg Lys Arg Ser Ala Gin 
465 470 



<210> 285 
<211> 344 
<212> PRT 
<213> E. Coli 

<400> 285 

Met Glu He Arg He Met Leu Phe He Leu Met Met Met Val Met Pro 

15 10 15 

Val Ser Tyr Ala Ala Cys Tyr Ser Glu Leu Ser Val Gin His Asn Leu 

20 25 30 

Val Val Gin Gly Asp Phe Ala Leu Thr Gin Thr Gin Met Ala Thr Tyr 

35 40 45 

Glu His Asn Phe Asn Asp Ser Ser Cys Val Ser Thr Asn Thr He Thr 

50 55 60 

Pro Met Ser Pro Ser Asp He He Val Gly Leu Tyr Asn Asp Thr He 
65 70 75 * 80 

Lys Leu Asn Leu His Phe Glu Trp Thr Asn Lys Asn Asn He Thr Leu 

85 90 95 

Ser Asn Asn Gin Thr Ser Phe Thr Ser Gly Tyr Ser Val Thr Val Thr 

100 105 HO 

Pro Ala Ala Ser Asn Ala Lys Val Asn Val Ser Ala Gly Gly Gly Gly 

115 120 125 

Ser Val Met He Asn Gly Val Ala Thr Leu Ser Ser Ala Ser Ser Ser 

130 135 140 

Thr Arg Gly Ser Ala Ala Val Gin Phe Leu Leu Cys Leu Leu Gly Gly 
145 150 155 " 160 

Lys Ser Trp Asp Ala Cys Val Asn Ser Tyr Arg Asn Ala Leu Ala Gin 

165 170 175 

Asn Ala Gly Val Tyr Ser Phe Asn Leu Thr Leu Ser Tyr Asn Pro He 

180 185 190 

Thr Thr Thr Cys Lys Pro Asp Asp Leu Leu He Thr Leu Asp Ser He 

195 200 205 

Pro Val Ser Gin Leu Pro Ala Thr Gly Asn Lys Ala Thr He Asn Ser 

210 215 220 

Lys Gin Gly Asp He He Leu Arg Cys Lys Asn Leu Leu Gly Gin Gin 
225 230 235 240 

Asn Gin Thr Ser Arg Lys Met Gin Val Tyr Leu Ser Ser Ser Asp Leu 

245 250 255 

Leu Thr Asn Ser Asn Thr He Leu Lys Gly Ala Glu Asp Asn Gly Val 

2 60 265 270 

Gly Phe lie Leu Glu Ser Asn Gly Ser Pro Val Thr Leu Leu Asn He 

275 280 285 

Thr Asn Ser Ser Lys Gly Tyr Thr Asn Leu Lys Glu Val Ala Ala Lys 

290 295 300 

Ser Lys Leu Thr Asp Thr Thr Val Ser He Pro lie Thr Ala Ser Tyr 
305 310 315 320 

Tyr Val Tyr Asp Thr Asn Lys Val Lys Ser Gly Ala Leu Glu Ala Thr 

325 330 335 

Ala Leu He Asn Val Lys Tyr Asp 
34 0 



<210> 286 
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<211> 826 
<212> PRT 
<213> E. Coli 



<400> 286 

Met Leu Arg Met Thr Pro Leu Ala Ser Ala lie Val Ala Leu Leu Leu 

15 10 15 

Gly lie Glu Ala Tyr Ala Ala Glu Glu Thr Phe Asp Thr His Phe Met 

20 25 30 

He Gly Gly Met Lys Asp Gin Gin Val Ala Asn He Arg Leu Asp Asp 

35 40 45 

Asn Gin Pro Leu Pro Gly Gin Tyr Asp He Asp He Tyr Val Asn Lys 

50 55 60 

Gin Trp Arg Gly Lys Tyr Glu He He Val Lys Asp Asn Pro Gin Glu 
65 70 75 80 

Thr Cys Leu Ser Arg Glu Val lie Lys Arg Leu Gly He Asn Ser Asp 

85 90 95 

Asn Phe Ala Ser Gly Lys Gin Cys Leu Thr Phe Glu Gin Leu Val Gin 

100 105 HO 

Gly Gly Ser Tyr Thr Trp Asp He Gly Val Phe Arg Leu Asp Phe Ser 

115 120 125 

Val Pro Gin Ala Trp Val Glu Glu Leu Glu Ser Gly Tyr Val Pro Pro 

130 135 140 

Glu Asn Trp Glu Arg Gly He Asn Ala Phe Tyr Thr Ser Tyr Tyr Leu 
145 150 155 160 

Ser Gin Tyr Tyr Ser Asp Tyr Lys Ala Ser Gly Asn Asn Lys Ser Thr 

165 170 * 175 

Tyr Val Arg Phe Asn Ser Gly Leu Asn Leu Leu Gly Trp Gin Leu His 

180 185 190 

Ser Asp Ala Ser Phe Ser Lys Thr Asn Asn Asn Pro Gly Val Trp Lys 

195 200 205 

Ser Asn Thr Leu Tyr Leu Glu Arg Gly Phe Ala Gin Leu Leu Gly Thr 

210 215 220 

Leu Arg Val Gly Asp Met Tyr Thr Ser Ser Asp He Phe Asp Ser Val 
225 230 235 240 

Arg Phe Arg Gly Val Arg Leu Phe Arg Asp Met Gin Met Leu Pro Asn 

245 250 255 

Ser Lys Gin Asn Phe Thr Pro Arg Val Gin Gly He Ala Gin Ser Asn 

260 265 270 

Ala Leu Val Thr He Glu Gin Asn Gly Phe Val Val Tyr Gin Lys Glu 

275 280 285 

Val Pro Pro Gly Pro Phe Ala He Thr Asp Leu Gin Leu Ala Gly Gly 

290 295 300 

Gly Ala Asp Leu Asp Val Ser Val Lys Glu Ala Asp Gly Ser Val Thr 
305 310 315 J 320 

Thr Tyr Leu Val Pro Tyr Ala Ala Val Pro Asn Met Leu Gin Pro Gly 

325 330 335 

Val Ser Lys Tyr Asp Leu Ala Ala Gly Arg Ser His He Glu Gly Ala 

340 345 350 

Ser Lys Gin Ser Asp Phe Val Gin Ala Gly Tyr Gin Tyr Gly Phe Asn 

355 360 365 

Asn Leu Leu Thr Leu Tyr Gly Gly Ser Met Val Ala Asn Asn Tyr Tyr 

370 375 380 

Ala Phe Thr Leu Gly Ala Gly Trp Asn Thr Arg lie Gly Ala lie Ser 
385 390 395 400 

Val Asp Ala Thr Lys Ser His Ser Lys Gin Asp Asn Gly Asp Val Phe 

405 410 ~ 415 

Asp Gly Gin Ser Tyr Gin lie Ala Tyr Asn Lys Phe Val Ser Gin Thr 

420 425 430 

Ser Thr Arg Phe Gly Leu Ala Ala Trp Arg Tyr Ser Ser Arg Asp Tyr 
435 440 445 
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Arg Thr Phe 

450 
Arg Asp Glu 
465 

Phe Gly Arg 

Glu Gly Trp 

Gly Arg Ser 
515 

Leu Arg Arg 

530 
Asn His His 
545 

Asp Trp Gly 

Asn Ser Thr 

Leu Ser Gly 
595 

Leu Ser His 

610 
Thr Trp Asn 
625 

Ser Thr Tyr 

Trp Ser Gly 

Val Met Asn 
675 

Tyr Arg Thr 

690 
Pro Tyr Arg 
705 

Glu Ala Glu 

Ala Val Val 

lie Lys Ala 
755 

Val Asn Asp 

770 
Gin Leu Phe 
785 

lie Asp Lys 
He Asp Glu 



Asn Asp His 

Asn Asp Val 
470 

Lys Asn Ser 

485 
Gly Ser Val 
500 

Gly Ser Ser 

He Ser Tyr 

Glu Glu Lys 
550 

Asp Asp Val 

565 
Thr Phe Asp 
580 

Thr Val Gly 

Gin His Gin 

Ala Pro Val 
630 

Arg Gin Ala 

645 
Gly Val Asn 
660 

Ala Pro Gly 

Thr Asn Arg 

Glu Asn His 
710 

Leu Arg Gly 

725 
Leu Val Asn 
740 

Leu Arg Ala 

He His Gly 

He Arg Thr 
790 

Gin Gin Gly 

805 
Ser Arg Asn 
820 



Val Trp Ala 
455 

Tyr Asp He 

Phe Ser Ala 

Ser Leu Ser 
505 

Lys Asp Tyr 

520 
Thr Leu Ala 
535 

Arg Phe Asn 

Ser Thr Pro 

Asp Gin Gly 
585 

Ser Arg Asp 

600 " 
Gly Asn Glu 
615 

Ala Thr Val 

Gly Ala Ser 

Leu Ala Asn 
665 

He Lys Asp 
680 

Asn Gly Val 
695 

Leu Met Leu 

Asn Arg Lys 

Phe Asp Thr 
745 

Asp Gly Gin 

.760 
His Asn He 
775 

Asn Glu Val 

Leu Ser Cys 

Tyr He Cys 
825 



Asn Asn Lys 
460 

Ala Asp Tyr 
475 

Asn Met Ser 
490 

Thr Leu Trp 

Gin Leu Ser 

Ala Ser Gin 
540 

He Phe He 

555 
Arg Arg Gin 
570 

Phe Ala Ser 

Gin Phe Asn 

Thr Thr Ala 
620 

Asn Gly Ser 
635 

Val Ser Gly 
650 

Arg Leu Ser 

Ala Tyr Val 

Val He Tyr 
700 

Asp Val Ser 

715 
He Ala Ala 
730 

Asp Gin Arg 

Ser Leu Thr 

Gly Val Val 
780 

Pro Pro Ser 

795 
Thr He Thr 
810 
Gin 



Asp Asn Tyr Arg 

Tyr Gin Asn Asp 
480 

Gin Ser Leu Pro 
4 95 

Arg Asp Tyr Trp 
510 

Tyr Ser Asn Asn 
525 

Ala Tyr Asp Glu 

Ser He Pro Phe 
560 

He Tyr Met Ser 
575 

Asn Asn Thr Gly 
590 

Tyr Gly Val Asn 
605 

Gly Ala Asn Leu 

Tyr Ser Gin Ser 
640 

Gly He Val Ala 
655 

Glu Thr Phe Ala 
670 

Asn Gly Gin Lys 
685 

Asp Gly Met Thr 

Gin Ser Asp Ser 
720 

Pro Tyr Arg Gly 
735 

Lys Pro Trp Phe 
750 

Phe Gly Tyr Glu 
765 

Gly Gin Gly Ser 

Val Asn Val Ala 
800 

Phe Gly Lys Glu 
815 



<210> 287 
<211> 239 
<212> PRT 
<213> E. Coli 

<400> 287 

Met Ala Ala He Pro Trp Arg Pro Phe Asn Leu Arg Gly He Lys Met 

1 5 10 15 

Lys Gly Leu Leu Ser Leu Leu He Phe Ser Met Val Leu Pro Ala His 

20 25 30 

Ala Gly He Val He Tyr Gly Thr Arg He He Tyr Pro Ala Glu Asn 
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35 








40 










45 








Lys 


Glu 


Val 


Met 


Val 


Gin 


Leu Met 


Asn 


Gin 


Gly Asn Arg Ser 


Ser 


Leu 




50 










55 








60 










Leu 


Gin 


Ala 


Trp 


He 


Asp 


Asp Gly Asp 


Thr 


Ser 


Leu 


Pro 


Pro 


Glu 


Lys 


65 










70 








75 










80 


lie 


Gin 


Val 


Pro 


Phe 


Met 


Leu Thr 


Pro 


Pro 


Val 


Ala 


Lys 


He 


Gly Ala 










85 








90 










95 




Asn 


Ser 


Gly 


Gin 


Gin 


Val 


Lys He 


Lys 


He 


Met 


Pro 


Asn 


Lys 


Leu 


Pro 








100 








105 










110 






Thr 


Asn 


Lys 


Glu 


Ser 


He 


Phe Tyr 


Leu 


Asn 


Val 


Leu 


Asp 


He 


Pro 


Pro 






115 








120 










125 








Asn 


Ser 


Pro 


Glu 


Gin 


Glu 


Gly Lys 


Asn 


Ala 


Leu 


Lys 


Phe 


Ala 


Met 


Gin 




130 










135 








140 










Asn 


Arg 


He 


Lys 


Leu 


Phe 


Tyr Arg 


Pro 


Ala 


Gly 


He 


Ala 


Pro 


Val 


Asn 


145 










150 








155 










160 


Lys Ala 


Thr 


Phe 


Lys 


Lys 


Leu Leu 


Val 


Asn 


Arg Ser Gly Asn Gly Leu 










165 








170 










175 




Val 


He 


Lys 


Asn 


Asp 


Ser 


Ala Asn 


Trp 


Val 


Thr 


He 


Ser 


Asp 


Val 


Lys 








180 








185 










190 






Ala 


Asn 


Asn 


Val 


Lys 


Val 


Asn Tyr 


Glu 


Thr 


He 


Met 


He 


Ala 


Pro 


Leu 






195 








200 










205 








Glu 


Ser 


Gin 


Ser 


Val 


Asn 


Val Lys 


Ser 


Asn 


Asn 


Ala 


Asn 


Asn 


Trp 


His 




210 










215 








220 










Leu 


Thr 


He 


He 


Asp Asp 


His Gly Asn 


Tyr 


lie 


Ser Asp 


Lys 


He 




225 










230 








235 













<210> 


288 
























<211> 


180 
























<212> 


PRT 
























<213> 


E. Coli 






















<400> 


288 
























Met Lys 


Arg 


Ser 


He 


He 


Ala 


Ala 


Ala 


Val 


Phe 


Ser Ser 


Phe 


Phe 


Met 


1 






5 










10 








15 




Ser Ala 


Gly 


Val 


Phe 


Ala 


Ala Asp Val 


Asp 


Thr 


Gly Thr Leu 


Thr 


He 






20 










25 








30 






Lys Gly 


Asn 


He 


Ala 


Glu 


Ser 


Pro 


Cys 


Lys 


Phe 


Glu Ala 


Gly Gly Asp 




35 










40 








45 








Ser Val 


Ser 


He 


Asn 


Met 


Pro 


Thr 


Val 


Pro 


Thr 


Ser Val 


Phe 


Glu 


Gly 


50 










55 










60 








Lys Ala 


Lys 


Tyr 


Ser 


Thr 


Tyr 


Asp 


Asp 


Ala 


Val 


Gly Val 


Thr 


Ser 


Ser 


65 








70 










75 








80 


Met Leu 


Lys 


He 


Ser 


Cys 


Pro 


Lys 


Glu 


Val 


Ala 


Gly Val 


Lys 


Leu 


Ser 








85 










90 








95 




Leu He 


Thr 


Asn 


Asp Lys 


He 


Thr 


Gly Asn 


Asp 


Lys Ala 


He 


Ala 


Ser 






100 










105 








110 






Ser Asn 


Asp 


Thr 


Val 


Gly 


Tyr 


Tyr 


Leu 


Tyr 


Leu 


Gly Asp Asn 


Ser Asp 




115 










120 








125 








Val Leu 


Asp 


Val 


Ser 


Ala 


Pro 


Phe 


Asn 


He 


Glu 


Ser Tyr 


Lys 


Thr 


Ala 


130 










135 










140 








Glu Gly 


Gin 


Tyr 


Ala 


He 


Pro 


Phe 


Lys 


Ala 


Lys 


Tyr Leu 


Lys 


Leu 


Thr 


145 








150 










155 








160 


Asp Asn 


Ser 


Val 


Gin 


Ser 


Gly Asp 


Val 


Leu 


Ser 


Ser Leu 


Val 


Met 


Arg 








165 










170 








175 




Val Ala 


Gin 


Asp 




























180 
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<211> 112 
<212> PRT 
<213> E. Coli 





<400> 289 










Met 


Ser 


Ser Glu 


Arg Asp 


Leu 


Val 


1 






5 








Asp 


Val 


Ala Lys 


Ala 


Val 


He 


Ala 






20 










Ser 


Leu 


Ala Ser 


Phe 


Ala 


Cys 


Val 






35 








40 


Val 


Gly 


Gly Ala 


lie 


Leu 


Leu 


Thr 




50 








55 




Asn 


Glu 


lie Asp 


Ala 


Gin 


Cys 


His 


65 








70 






lie 


Arg 


Asp Gly 


Leu 


Lys 


Arg 


Gin 








85 








Glu 


Asn 


Met Thr 


Pro 


Phe 


Met 


Tyr 






100 









Asn 


Phe 


Leu Gly Asp 


Phe 


Ser 


Met 




10 






15 




Gly 


Gly Val Ala Thr 


Ala 


He 


Gly 


25 






30 




Ser 


Phe 


Gly Phe Pro 


Val 


He 


Leu 






45 








Gly 


He 


Val Cys Thr 


Val 


Val 


Leu 






60 








Leu 


Ser 


Glu Lys Leu 


Lys 


Tyr 


Ala 






75 






80 


Gin 


Glu 


Leu Asp Lys 


Trp 


Lys 


Arg 




90 






95 




Val 


Leu 


Asn Thr Pro 


Pro 


Val 


He 


105 






110 
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<400> 


2 90 














Met 


Thr 


Asp Tyr 


Leu 


Leu 


Leu Phe Val 


Gly Thr Val Leu Val 


Asn 


Asn 


1 








5 






10 


15 




Phe 


Val 


Leu 


Val 


Lys 


Phe 


Leu Gly Leu 


Cys Pro Phe Met Gly Val 


Ser 








20 






25 


30 






Lys 


Lys 


Leu 


Glu 


Thr 


Ala 


Met Gly Met Gly Leu Ala Thr Thr 


Phe 


Val 






35 








40 


45 






Met 


Thr 


Leu 


Ala 


Ser 


He 


Cys Ala Trp 


Leu He Asp Thr Trp 


He 


Leu 




50 










55 


60 






He 


Pro 


Leu 


Asn 


Leu 


He 


Tyr Leu Arg 


Thr Leu Ala Phe He 


Leu 


Val 


65 










70 




75 




80 


He 


Ala 


Val 


Val 


Val 


Gin 


Phe Thr Glu 


Met Val Val Arg Lys 


Thr 


Ser 










85 






90 


95 




Pro 


Val 


Leu 


Tyr 


Arg 


Leu 


Leu Gly He 


Phe Leu Pro Leu He 


Thr 


Thr 








100 






105 


110 






Asn Cys 


Ala 


Val 


Leu 


Gly 


Val Ala Leu 


Leu Asn lie Asn Leu 


Gly His 






115 








120 


125 






Asn 


Phe 


Leu 


Gin 


Ser 


Ala 


Leu Tyr Gly 


Phe Ser Ala Ala Val 


Gly 


Phe 




130 










135 


140 




Ser 


Leu 


Val 


Met 


Val 


Leu 


Phe Ala Ala 


lie Arg Glu Arg Leu Ala Val 


145 










150 




155 




160 


Ala Asp 


Val 


Pro 


Ala 


Pro 


Phe Arg Gly Asn Ala He Ala Leu 


lie 


Thr 










165 






170 


175 




Ala 


Gly 


Leu 


Met 


Ser 


Leu 


Ala Phe Met 


Gly Phe Ser Gly Leu Val 


Lys 








180 






185 


190 




Leu 
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<400> 291 

Met Asn Ala He Trp He Ala Val Ala Ala Val Ser Leu Leu Gly Leu 

15 10 15 

Ala Phe Gly Ala He Leu Gly Tyr Ala Ser Arg Arg Phe Ala Val Glu 

20 25 30 

Asp Asp Pro Val Val Glu Lys He Asp Glu He Leu Pro Gin Ser Gin 

35 40 45 

Cys Gly Gin Cys Gly Tyr Pro Gly Cys Arg Pro Tyr Ala Glu Ala He 

50 55 60 

Ser Cys Asn Gly Glu Lys He Asn Arg Cys Ala Pro Gly Gly Glu Ala 
65 70 75 " 80 

Val Met Leu Lys He Ala Glu Leu Leu Asn Val Glu Pro Gin Pro Leu 

85 90 95 

Asp Gly Glu Ala Gin Glu He Thr Pro Ala Arg Met Val Ala Val He 

100 105 HO 

Asp Glu Asn Asn Cys He Gly Cys Thr Lys Cys He Gin Ala Cys Pro 

115 120 125 

Val Asp Ala He Val Gly Ala Thr Arg Ala Met His Thr Val Met Ser 

130 135 140 

Asp Leu Cys Thr Gly Cys Asn Leu Cys Val Asp Pro Cys Pro Thr His 
145 150 155 " 160 

Cys He Ser Leu Gin Pro Val Ala Glu Thr Pro Asp Ser Trp Lys Trp 

165 170 * 175 

Asp Leu Asn Thr lie Pro Val Arg He He Pro Val Glu His His Ala 
180 185 190 



<210> 292 
<211> 740 
<212> PRT 
<213> E. Coli 

<400> 292 

Met Leu Lys Leu Phe Ser Ala Phe Arg Lys Asn Lys He Trp Asp Phe 

1 5 10 15 

Asn Gly Gly He His Pro Pro Glu Met Lys Thr Gin Ser Asn Gly Thr 

20 25 30 

Pro Leu Arg Gin Val Pro Leu Ala Gin Arg Phe Val He Pro Leu Lys 

35 40 45 

Gin His He Gly Ala Glu Gly Glu Leu Cys Val Ser Val Gly Asp Lys 

50 55 60 

Val Leu Arg Gly Gin Pro Leu Thr Arg Gly Arg Gly Lys Met Leu Pro 
65 70 75 ~ 80 

Val His Ala Pro Thr Ser Gly Thr Val Thr Ala He Ala Pro His Ser 

85 90 95 

Thr Ala His Pro Ser Ala Leu Ala Glu Leu Ser Val He He Asp Ala 

100 105 HO 

Asp Gly Glu Asp Cys Trp lie Pro Arg Asp Gly Trp Ala Asp Tyr Arg 

115 120 * 125 

Thr Arg Ser Arg Glu Glu Leu He Glu Arg He His Gin Phe Gly Val 

130 135 140 

Ala Gly Leu Gly Gly Ala Gly Phe Pro Thr Gly Val Lys Leu Gin Gly 
145 150 155 160 

Gly Gly Asp Lys lie Glu Thr Leu lie lie Asn Ala Ala Glu Cys Glu 

165 170 175 

Pro Tyr lie Thr Ala Asp Asp Arg Leu Met Gin Asp Cys Ala Ala Gin 

180 185 190 

Val Val Glu Gly lie Arg lie Leu Ala His lie Leu Gin Pro Arg Glu 

195 200 205 

lie Leu lie Gly lie Glu Asp Asn Lys Pro Gin Ala lie Ser Met Leu 
210 215 220 
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Arg Ala Val Leu Ala Asp Ser Asn Asp lie Ser Leu Arg Val lie Pro 
225 230 235 ' 240 

Thr Lys Tyr Pro Ser Gly Gly Ala Lys Gin Leu Thr Tyr He Leu Thr 

245 250 J 255 

Gly Lys Gin Val Pro His Gly Gly Arg Ser Ser Asp He Gly Val Leu 

260 265 270 

Met Gin Asn Val Gly Thr Ala Tyr Ala Val Lys Arg Ala Val He Asp 

275 280 285 

Gly Glu Pro He Thr Glu Arg Val Val Thr Leu Thr Gly Glu Ala He 

290 295 300 

Ala Arg Pro Gly Asn Val Trp Ala Arg Leu Gly Thr Pro Val Arg His 
305 310 315 320 

Leu Leu Asn Asp Ala Gly Phe Cys Pro Ser Ala Asp Gin Met Val He 

325 330 335 

Met Gly Gly Pro Leu Met Gly Phe Thr Leu Pro Trp Leu Asp Val Pro 

340 345 350 

Val Val Lys He Thr Asn Cys Leu Leu Ala Pro Ser Ala Asn Glu Leu 

355 360 365 

Gly Glu Pro Gin Glu Glu Gin Ser Cys lie Arg Cys Ser Ala Cys Ala 

370 375 380 

Asp Ala Cys Pro Ala Asp Leu Leu Pro Gin Gin Leu Tyr Trp Phe Ser 
385 390 395 400 

Lys Gly Gin Gin His Asp Lys Ala Thr Thr His Asn He Ala Asp Cys 

405 410 415 

He Glu Cys Gly Ala Cys Ala Trp Val Cys Pro Ser Asn He Pro Leu 

420 425 430 

Val Gin Tyr Phe Arg Gin Glu Lys Ala Glu He Ala Ala He Arg Gin 

435 440 445 

Glu Glu Lys Arg Ala Ala Glu Ala Lys Ala Arg Phe Glu Ala Arg Gin 

450 455 460 

Ala Arg Leu Glu Arg Glu Lys Ala Ala Arg Leu Glu Arg His Lys Ser 
465 470 475 480 

Ala Ala Val Gin Pro Ala Ala Lys Asp Lys Asp Ala He Ala Ala Ala 

485 490 495 

Leu Ala Arg Val Lys Glu Lys Gin Ala Gin Ala Thr Gin Pro lie Val 

500 505 510 

lie Lys Ala Gly Glu Arg Pro Asp Asn Ser Ala lie lie Ala Ala Arg 

515 520 525 

Glu Ala Arg Lys Ala Gin Ala Arg Ala Lys Gin Ala Glu Leu Gin Gin 

530 535 540 

Thr Asn Asp Ala Ala Thr Val Ala Asp Pro Arg Lys Thr Ala Val Glu 
545 550 555 * 560 

Ala Ala lie Ala Arg Ala Lys Ala Arg Lys Leu Glu Gin Gin Gin Ala 

565 570 575 

Asn Ala Glu Pro Glu Gin Gin Val Asp Pro Arg Lys Ala Ala Val Glu 

580 585 ~ 590 

Ala Ala lie Ala Arg Ala Lys Ala Arg Lys Leu Glu Gin Gin Gin Ala 

595 600 605 

Asn Ala Glu Pro Glu Glu Gin Val Asp Pro Arg Lys Ala Ala Val Glu 

610 615 620 

Ala Ala lie Ala Arg Ala Lys Ala Arg Lys Leu Glu Gin Gin Gin Ala 
625 630 635 640 

Asn Ala Glu Pro Glu Gin Gin Val Asp Pro Arg Lys Ala Ala Val Glu 

645 650 655 

Ala Ala lie Ala Arg Ala Lys Ala Arg Lys Arg Glu Gin Gin Pro Ala 

660 665 670 

Asn Ala Glu Pro Glu Glu Gin Val Asp Pro Arg Lys Ala Ala Val Glu 

675 680 685 

Ala Ala lie Ala Arg Ala Lys Ala Arg Lys Leu Glu Gin Gin Gin Ala 

690 695 700 

Asn Ala Val Pro Glu Glu Gin Val Asp Pro Arg Lys Ala Ala Val Ala 
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705 710 715 720 

Ala Ala lie Ala Arg Ala Gin Ala Lys Lys Ala Ala Gin Gin Lys Val 
725 730 735 

Val Asn Glu Asp 
740 



<210> 293 
<211> 352 
<212> PRT 
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<400> 


293 










Met Val 


Phe 


Arg 


He 


Ala 


Ser 


Ser 


1 






5 








Thr Ser 


Arg 


lie 


Met 


Leu 


Leu 


Val 






20 










Ala Ala 


Gin 


Leu 


Trp 


Phe 


Phe 


Gly 




35 










40 


Leu Ala 


Ser 


Val 


Ser 


Ala 


Leu 


Leu 


50 










55 




Arg Lys 


Gin 


Ser 


Val 


Ala 


Ala 


Thr 


65 








70 






Thr Gly 


Leu 


Leu 


Leu 


Ala 


Val 


Ser 








85 








Met Val 


Val 


Leu 


Gly 


Thr 


Val 


Phe 






100 










Tyr Gly Gly 


Leu 


Gly 


Gin 


Asn 


Pro 




115 










120 


Val Val 


Leu 


Leu 


He 


Ser 


Phe 


Pro 


130 










135 




Pro His 


Glu 


lie 


Ala 


Val 


Asn 


He 


145 








150 






Val lie 


Phe 


Ser 


Gly 


His 


Thr 


Ala 








165 








Arg Leu 


Gly 


lie 


Asp 


Gly 


He 


Ser 






180 










Lys Thr 


Ser 


Val 


Arg 


Ala 


Gly 


His 




195 










200 


Pro lie 


Tyr 


Ser 


Gly 


He 


Leu 


Ala 


210 










215 




Leu Ala 


Trp 


Leu 


Ala 


Gly 


Gly 


Val 


225 








230 






Arg Trp 


His 


He 


Pro 


Leu 


Ser 


Phe 








245 








Met Leu 


Gly 


Trp 


Leu 


Phe 


Ser 


Pro 






260 










His Leu 


Leu 


Ser 


Gly 


Ala 


Thr 


Met 




275 










280 


Asp Pro 


Val 


Thr 


Ala 


Ser 


Thr 


Thr 


290 










295 




Ala Leu 


Ala 


Gly 


Leu 


Leu 


Val 


Trp 


305 








310 






Pro Asp 


Gly 


Val 


Ala 


Phe 


Ala 


Val 








325 








Leu lie 


Asp 


Tyr 


Tyr 


Thr 


Arg 


Pro 



340 



Pro 


Tyr 


Thr 


His 


Asn 


Gin 


Arg 


Gin 




10 










15 




Leu 


Leu 


Ala 


Ala 


Val 


Pro 


Gly 


He 


25 










30 






Trp 


Gly 


Thr 


Leu 


Val 


Gin 


He 


Leu 










45 








Ala 


Glu 


Ala 


Leu 


Val 


Leu 


Lys 


Leu 








60 










Leu 


Lys 


Asp 


Asn 


Ser 


Ala 


Leu 


Leu 






75 










80 


lie 


Pro 


Pro 


Leu 


Ala 


Pro 


Trp 


Trp 




90 










95 




Ala 


Val 


He 


He 


Ala 


Lys 


Gin 


Leu 


105 










110 






Phe 


Asn 


Pro 


Ala 


Met 


He 


Gly 


Tyr 










125 








Val 


Gin 


Met 


Thr 


Ser 


Trp 


Leu 


Pro 








140 










Pro 


Gly 


Phe 


He. 


Asp 


Ala 


He 


Gin 






155 










160 


Ser 


Gly 


Gly 


Asp 


Met 


Asn 


Thr 


Leu 




170 










175 




Gin 


Ala 


Thr 


Pro 


Leu 


Asp 


Thr 


Phe 


185 










190 






Ser 


Val 


Glu 


Gin 


He 


Met 


Gin 


Tyr 










205 








Gly 


Ala 


Gly 


Trp 


Gin 


Trp 


Val 


Asn 








220 










Trp 


Leu 


Leu 


Trp 


Gin 


Lys 


Ala 


lie 






235 










240 


Leu 


val 


Thr 


Leu 


Ala 


Leu 


Cys 


Ala 




250 










255 




Glu 


Thr 


Leu 


Ala 


Ala 


Pro 


Gin 


He 


265 










270 






Leu 


Gly 


Ala 


Phe 


Phe 


He 


Leu 


Thr 










285 








Asn 


Arg 


Gly 


Arg 


Leu 


He 


Phe 


Gly 








300 










Leu 


He 


Arg 


Ser 


Phe 


Gly 


Gly 


Tyr 






315 










320 


Leu 


Leu 


Ala 


Asn 


He 


Thr 


Val 


Pro 




330 










335 




Arg 


Val 


Tyr 


Gly 


His 


Arg 


Lys 


Gly 


345 










350 
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<400> 294 



Met 


Leu 


Lys 


Thr 


He 


Arg 


Lys 


His 


Gly 


lie 


Thr 


Leu 


Ala 


Leu 


Phe 


Ala 


1 








5 










10 










15 




Ala 


Gly 


Ser 


Thr 


Gly 


Leu 


Thr 


Ala 


Ala 


He 


Asn 


Gin 


Met 


Thr 


Lys 


Thr 








20 










25 










30 






Thr 


lie 


Ala 


Glu 


Gin 


Ala 


Ser 


Leu 


Gin 


Gin 


Lys 


Ala 


Leu 


Phe 


Asp 


Gin 






35 










40 










45 






Val 


Leu 


Pro 


Ala 


Glu 


Arg 


Tyr 


Asn 


Asn 


Ala 


Leu 


Ala 


Gin 


Ser 


Cys 


Tyr 




50 










55 










60 






Leu 


Val 


Thr 


Ala 


Pro 


Glu 


Leu 


Gly 


Lys 


Gly 


Glu 


His 


Arg 


Val 


Tyr 


lie 


65 










70 










75 










80 


Ala 


Lys 


Gin 


Asp 


Asp 


Lys 


Pro 


Val 


Ala 


Ala 


Val 


Leu 


Glu 


Ala 


Thr 


Ala 










85 










90 










95 




Pro Asp 
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Gly 


Ala 


He 
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Leu 


Leu 


Val 


Gly 


Ala 


Asp 


Phe 
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Gly 


Thr 


Val 


Leu 


Gly 


Thr 


Arg 


Val 


Thr 


Glu 


His 


His 


Glu 


Thr 


Pro 
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120 
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Gly 


Leu 


Gly 


Asp 


Lys 


He 


Glu 


Leu 


Arg 


Leu 


Ser 


Asp 


Trp 


lie 


Thr 


His 




130 










135 










140 










Phe 


Ala 


Gly 


Lys 


Lys 


He 


Ser 


Gly 


Ala 


Asp 


Asp 


Ala 


His 


Trp 


Ala 


Val 


145 










150 










155 
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Lys 


Lys 


Asp 


Gly 


Gly 


Asp 


rim 


Asp 


oin 


trne 


i nr 


Kj±y 




i nr 


lie 


i nr 
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Pro 


Arg 


Ala 


Val 


Val 


Asn 


Ala 


Val 


Lys 


Arg 


Ala 


Gly 


Leu 


Tyr 


Ala 


Gin 








180 










185 
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Pro 


Ala 


Gin 


Leu 
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Gin 


Leu 


Pro 


Ala 


Cys 


Gly 


Glu 
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205 
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231 




























<212> 
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<213> 


E. Coli 
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295 


























Met 


Ser 


Glu 


He 


Lys 


Asp 


Val 


He 


Val 


Gin 


Gly 


Leu 


Trp 


Lys 


Asn 


Asn 
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Ala 


Leu 


Val 


Gin 


Leu 


Leu 


Gly 


Leu 
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Pro 


Leu 


Leu 


Ala 


Val 


Thr 
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Ser 


Thr 


Ala 


Thr 


Asn 


Ala 


Leu 


Gly 


Leu 


Gly 


Leu 


Ala 


Thr 


Thr 


Leu 


Val 
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Leu 


Thr 


Leu 


Thr 


Asn 


Leu 


Thr 


He 


Ser 


Thr 


Leu 


Arg 


His 


Trp 


Thr 


Pro 
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Ala 


Glu 


lie 


Arg 


He 


Pro 


He 


Tyr 


Val 


Met 


lie 


lie 


Ala 


Ser 


Val 


Val 
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Ser 


Ala 


Val 


Gin 


Met 


Leu 


He 


Asn 


Ala 


Tyr 


Ala 


Phe 


Gly 


Leu 


Tyr 


Gin 
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90 
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Ser 


Leu 


Gly 


He 


Phe 


He 


Pro 


Leu 


lie 


Val 


Thr 


Asn 


Cys 


lie 


Val 


Val 
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105 








110 






Gly Arg 


Ala 


Glu 


Ala 


Phe 


Ala 


Ala 


Lys 


Lys 


Gly 


Pro 


Ala 


Leu 


Ser 


Ala 






115 










120 










125 








Leu Asp 


Gly 


Phe 


Ser 


lie 


Gly 


Met 


Gly 


Ala 


Thr 


Cys 


Ala 


Met 


Phe 


Val 




130 










135 










140 










Leu Gly 


Ser 


Leu 


Arg 


Glu 


He 


He 


Gly 


Asn 


Gly 


Thr 


Leu 


Phe 


Asp 


Gly 


145 










150 










155 










160 


Ala 


Asp 


Ala 


Leu 


Leu 


Gly 


Ser 


Trp 


Ala 


Lys 


Val 


Leu 


Arg 


Val 


Glu 


lie 










165 










170 










175 
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Phe His Thr 


Asp 


Ser 


Pro 


Phe 


Leu 


Leu 


Ala Met Leu 


Pro 


Pro 


Gly Ala 




180 










185 






190 


Phe He Gly 


Leu 


Gly 


Leu 


Met 


Leu 


Ala 


Gly Lys Tyr 


Leu 


He 


Asp Glu 


195 










200 






205 




Arg Met Lys 


Lys 


Arg 


Arg 


Ala 


Glu 


Ala 


Ala Ala Glu 


Arg 


Ala 


Leu Pro 


210 








215 






220 








Asn Gly Glu 


Thr 


Gly 


Asn 


Val 














225 






230 

















<210> 296 
<211> 211 
<212> PRT 
<213> E. Coli 



<400> 296 



Met Asn 


Lys 


Ala 


Lys 


Arg 


Leu 


Glu 


He 


Leu Thr Arg 


Leu 


Arg 


Glu 


Asn 


1 






5 










10 








15 




Asn Pro 


His 


Pro 


Thr 


Thr 


Glu 


Leu 


Asn 


Phe Ser 


Ser 


Pro 


Phe 


Glu 


Leu 






20 










25 








30 






Leu He 


Ala 


Val 


Leu 


Leu 


Ser 


Ala 


Gin 


Ala Thr Asp 


Val 


Ser 


Val 


Asn 




35 










40 








45 








Lys Ala 


Thr 


Ala 


Lys 


Leu 


Tyr 


Pro 


Val 


Ala Asn 


Thr 


Pro 


Ala 


Ala 


Met 


50 










55 








60 










Leu Glu 


Leu 


Gly 


Val 


Glu 


Gly 


Val 


Lys 


Thr Tyr 


He 


Lys 


Thr 


He 


Gly 


65 








70 








75 










80 


Leu Tyr 


Asn 


Ser 


Lys 


Ala 


Glu 


Asn 


He 


He Lys 


Thr 


Cys 


Arg 


He 


Leu 








85 










90 








95 




Leu Glu 


Gin 


His 


Asn 


Gly 


Glu 


Val 


Pro 


Glu Asp Arg 


Ala 


Ala 


Leu 


Glu 






100 










105 








110 






Ala Leu 


Pro 


Gly 


Val 


Gly 


Arg 


Lys 


Thr 


Ala Asn 


Val 


Val 


Leu 


Asn 


Thr 




115 










120 








125 








Ala Phe 


Gly 


Trp 


Pro 


Thr 


He 


Ala 


Val 


Asp Thr His 


He 


Phe 


Arg 


Val 


130 










135 








140 










Cys Asn 


Arg 


Thr 


Gin 


Phe 


Ala 


Pro 


Gly 


Lys Asn Val 


Glu 


Gin 


Val 


Glu 


145 








150 








155 










160 


Glu Lys 


Leu 


Leu 


Lys 


Val 


Val 


Pro 


Ala 


Glu Phe 


Lys 


Val 


Asp 


Cys 


His 








165 










170 








175 




His Trp 


Leu 


He 


Leu 


His 


Gly 


Arg 


Tyr 


Thr Cys 


He 


Ala 


Arg 


Lys 


Pro 






180 










185 








190 






Arg Cys 


Gly 


Ser 


Cys 


He 


He 


Glu 


Asp 


Leu Cys 


Glu 


Tyr 


Lys 


Glu 


Lys 




195 










200 








205 








Val Asp 


He 


























210 




























<210> 


297 
























<211> 


167 
























<212> 


PRT 
























<213> 


E. Coli 






















<400> 


297 
























Met Lys 


Arg 


Leu 


His 


Lys 


Arg 


Phe 


Leu 


Leu Ala 


Thr 


Phe 


Cys 


Ala 


Leu 


1 






5 










10 








15 




Phe Thr 


Ala 


Thr 


Leu 


Gin 


Ala 


Ala 


Asp 


Val Thr 


He 


Thr 


Val 


Asn 


Gly 






20 










25 








30 




Arg Val 


Val 


Ala 


Lys 


Pro 


Cys 


Thr 


He 


Gin Thr 


Lys 


Glu 


Ala 


Asn 


Val 




35 










40 








45 








Asn Leu 


Gly 


Asp 


Leu 


Tyr 


Thr 


Arg 


Asn 


Leu Gin 


Gin 


Pro 


Gly 


Ser 


Ala 


50 










55 








60 










Ser Gly 


Trp 


His 


Asn 


He 


Thr 


Leu 


Ser 


Leu Thr 


Asp 


Cys 


Pro 


Val 


Glu 
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65 








70 








75 










80 


Thr 


Ser 


Ala 


Val Thr 


Ala 


He 


Val 


Thr 


Gly Ser 


Thr 


Asp Asn 


Thr 


Gly 








85 










90 








95 




Tyr 


Tyr 


Lys 


Asn Glu 


Gly 


Thr 


Ala 


Glu 


Asn He 


Gin 


He 


Glu 


Leu 


Arg 








100 








105 








110 






Asp 


Asp 


Gin 


Asp Ala 


Ala 


Leu 


Lys 


Asn 


Gly Asp 


Ser 


Lys 


Thr 


Val 


He 






115 








120 








125 








Val 


Asp 


Glu 


He Thr 


Arg 


Asn 


Ala 


Gin 


Phe Pro 


Leu 


Lys 


Ala 


Arg Ala 




130 








135 








140 










lie 


Thr 


Val 


Asn Gly 


Asn 


Ala 


Ser Gin Gly Thr He 


Glu 


Ala 


Leu 


He 


145 








150 








155 










160 


Asn 


Val 


He 


Tyr Thr 


Trp Gin 
























165 























<210> 298 
<211> 176 
<212> PRT 
<213> E. Coli 



<400> 298 



Met 


Lys 


Tyr 


Asn 


Asn 


He He 


Phe Leu Gly Leu Cys 


Leu Gly 


Leu Thr 


1 








5 




10 




15 


Thr 


Tyr 


Ser 


Ala 


Leu 


Ser Ala 


Asp Ser Val lie Lys 


lie Ser 


Gly Arg 








20 






25 


30 




Val 


Leu 


Asp 


Tyr 


Gly 


Cys Thr 


Val Ser Ser Asp Ser 


Leu Asn 


Phe Thr 






35 








40 


45 




Val 


Asp 


Leu 


Gin 


Lys 


Asn Ser 


Ala Arg Gin Phe Pro 


Thr Thr Gly Ser 




50 








55 


60 






Thr 


Ser 


Pro 


Ala 


Val 


Pro Phe 


Gin lie Thr Leu Ser 


Glu Cys 


Ser Lys 


65 










70 


75 




80 


Gly 


Thr 


Thr 


Gly 


Val 


Arg Val 


Ala Phe Asn Gly lie 


Glu Asp Ala Glu 










85 




90 




95 


Asn 


Asn 


Thr 


Leu 


Leu 


Lys Leu 


Asp Glu Gly Ser Asn 


Thr Ala 


Ser Gly 








100 






105 


110 




Leu 


Gly 


lie 


Glu 


He 


Leu Asp 


Ala Asn Met Arg Pro 


Val Lys 


Leu Asn 






115 








120 


125 




Asp 


Leu 


His 


Ala 


Gly 


Met Gin 


Trp lie Pro Leu Val 


Pro Glu 


Gin Asn 




130 








135 


140 






Asn 


He 


Leu 


Pro 


Tyr 


Ser Ala 


Arg Leu Lys Ser Thr 


Gin Lys 


Ser Val 


145 










150 


155 




160 


Asn 


Pro 


Gly 


Leu 


Val 


Arg Ala 


Ser Ala Thr Phe Thr 


Leu Glu 


Phe Gin 










165 




170 




175 





<210> 


299 














<211> 


382 














<212> 


PRT 














<213> 


E . Coli 














<400> 


299 












Met 


Ser Gly 


Tyr Thr 


Val 


Lys 


Pro 


Pro Thr Gly Asp 


Thr Asn Glu Gin 


1 




5 








10 


15 


Thr 


Gin Phe 


lie Asp 


Tyr 


Phe 


Asn 


Leu Phe Tyr Ser 


Lys Arg Gly Gin 






20 








25 


30 


Glu 


Gin lie 


Ser lie 


Ser 


Gin 


Gin 


Leu Gly Asn Tyr 


Gly Thr Thr Phe 




35 








40 




45 


Phe 


Ser Ala 


Ser Arg 


Gin 


Ser 


Tyr 


Trp Asn Thr Ser 


Arg Ser Asp Gin 




50 






55 




60 
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Gin lie Ser Phe Gly Leu Asn Val Pro Phe Gly Asp lie Thr Thr Ser 
65 70 75 80 

Leu Asn Tyr Ser Tyr Ser Asn Asn lie Trp Gin Asn Asp Arg Asp His 

85 90 95 

Leu Leu Ala Phe Thr Leu Asn Val Pro Phe Ser His Trp Met Arg Thr 

100 105 HO 

Asp Ser Gin Ser Ala Phe Arg Asn Ser Asn Ala Ser Tyr Ser Met Ser 

115 120 125 

Asn Asp Leu Lys Gly Gly Met Thr Asn Leu Ser Gly Val Tyr Gly Thr 

130 135 140 

Leu Leu Pro Asp Asn Asn Leu Asn Tyr Ser Val Gin Val Gly Asn Thr 
145 150 155 160 

His Gly Gly Asn Thr Ser Ser Gly Thr Ser Gly Tyr Ser Ser Leu Asn 

165 170 ' 175 

Tyr Arg Gly Ala Tyr Gly Asn Thr Asn Val Gly Tyr Ser Arg Ser Gly 

180 185 190 

Asp Ser Ser Gin He Tyr Tyr Gly Met Ser Gly Gly He He Ala His 

195 200 " 205 

Ala Asp Gly He Thr Phe Gly Gin Pro Leu Gly Asp Thr Met Val Leu 

210 215 220 

Val Lys Ala Pro Gly Ala Asp Asn Val Lys lie Glu Asn Gin Thr Gly 
225 230 235 240 

He His Thr Asp Trp Arg Gly Tyr Ala He Leu Pro Phe Ala Thr Glu 

245 250 255 

Tyr Arg Glu Asn Arg Val Ala Leu Asn Ala Asn Ser Leu Ala Asp Asn 

260 265 270 

Val Glu Leu Asp Glu Thr Val Val Thr Val lie Pro Thr His Gly Ala 

275 280 285 

He Ala Arg Ala Thr Phe Asn Ala Gin lie Gly Gly Lys Val Leu Met 

290 295 300 

Thr Leu Lys Tyr Gly Asn Lys Ser Val Pro Phe Gly Ala lie Val Thr 
305 310 315 320 

His Gly Glu Asn Lys Asn Gly Ser lie Val Ala Glu Asn Gly Gin Val 

325 330 335 

Tyr Leu Thr Gly Leu Pro Gin Ser Gly Gin Leu Gin Val Ser Trp Gly 

340 345 350 

Lys Asp Lys Asn Ser Asn Cys lie Val Glu Tyr Lys Leu Pro Glu Val 

355 360 * 365 

Ser Pro Gly Thr Leu Leu Asn Gin Gin Thr Ala lie Cys Arg 
370 375 380 



<210> 300 
<211> 138 
<212> PRT 
<213> E. Coli 





<400> 


300 






















Met 


lie 


Ala 


lie 


Ala 


Asp 


He 


Leu 


Gin 


Ala 


Gly Glu 


Lys 


Leu Thr 


Ala 


1 








5 










10 
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Val 


Ala 


Pro 


Phe 


Leu 


Ala 


Gly 


He 


Gin 


Asn 


Glu Glu 


Gin 


Tyr Thr 


Gin 
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Leu 


Glu 


Leu 


Val 


Asp 


His 


Leu 


Leu 


Leu Asn Asp 


Pro 


Glu Asn 


Pro 
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45 






Leu 


Leu 


Asp 


Leu 


Val 


Cys 


Ala 


Lys 


He 


Thr 


Ala Trp 


Glu 


Glu Ser 


Ala 
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Pro 


Glu 


Phe 


Ala 


Glu 


Phe 


Asn 


Ala 


Met 


Ala 


Gin Ala 


Met 


Pro Gly Gly 


65 










70 
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80 


lie 


Ala 


Val 


lie 


Arg 


Thr 


Leu 


Met 


Asp 


Gin 


Tyr Gly 


Leu 


Thr Leu 


Ser 
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90 






95 




Asp Leu 


Pro 


Glu 


lie 


Gly Ser 


Lys 


Ser 


Met 


Val Ser Arg Val Leu 


Ser 
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100 

Gly Lys Arg Lys 
115 

Phe Gly He Ser 
130 



Leu Thr Leu Glu 
120 

Pro Ala Leu Phe 
135 



105 

His Ala Lys Lys 
He Asp 



110 

Leu Ala Thr Arg 
125 



<210> 301 
<211> 104 
<212> PRT 
<213> E . Coli 



<400> 301 



Met His 


Leu 


He 


Thr 


Gin 


Lys 


Ala 


1 






5 








Pro Gin 


His 


Lys 


Thr 


Glu 


Leu 


Val 






20 










Gly Tyr 


Phe 


Lys 


Lys 


Pro 


Glu 


Ser 




35 










40 


Asp Asn 


Phe 


Lys 


Tyr 


Leu 


Asp 


Lys 


50 










55 




Asn Glu 


Leu 


Arg 


Val 


Val 


Ala 


Met 


65 








70 






Tyr He 


Arg 


Glu 


Val 


Met 


Thr 


His 








85 








Val His 


Arg 


Thr 


Lys 


Gly 


Lys 


Lys 



100 



Leu 


Lys Asp Ala 


Ala 


Glu 


Lys 


Tyr 




10 






15 




Ala 


Leu Gly Asn 


Thr 


He 


Ala 


Lys 


25 






30 




Leu 


Lys Ala Val 


Phe 


Pro 


Ser 


Leu 






45 








His 


Tyr Val Phe 


Asn Val Gly Gly 
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Val 


Phe Phe Glu 


Ser 


Gin 


Lys 


Cys 




75 








80 


Lys 


Glu Tyr Asp 


Phe 


Phe 


Thr 


Ala 




90 






95 





<210> 302 
<211> 2383 
<212> PRT 
<213> E . Coli 

<400> 302 



Met 


Leu 


Ser 


Val 


Phe 


Thr 


Phe 


Phe 


1 








5 








Met 


Leu 


Ala 


Arg 


Ser 


Gly 


Lys 


Val 








20 










Gly Glu 


Glu 


He 


Asn 


Asp 


Arg 


Gin 






35 










40 


Leu Arg 


Arg 


Leu 


Thr 


Ala 


Gly 


He 




50 










55 




Pro 


Met 


Ala 


Ala 


Ala 


Ala 


Gin 


Gly 


65 










70 






Pro 


Val 


Pro 


Ala 


Gin 


He 


Ala 


He 










85 








Thr 


Leu 


Gly 


Ala 


Leu 


Glu 


Ser 


Ala 








100 










He 


Ser 


Val 


Ala 


Glu 


Leu 


Arg 


Lys 






115 










120 


Arg 


Ser 


Phe 


Asp 


Asn 


Val 


Arg 


Gin 




130 










135 




Gin 


Val 


Ser 


Glu 


Lys 


Lys 


Leu 


Thr 


145 










150 






Asn 


Leu 


Glu 


Gin 


Gin 


He 


Ala 


Ser 










165 








Leu 


Ala 


Glu 


Asp 


Met 


Asn 


Ser 


Glu 








180 










Trp Ala 


Ser 


Ser 


Gin 


Ala 


Ser 


Gly 



Arg 


Cys Ala Arg 


Lys 


Gly Ala 


Phe 




10 










15 




Ser 


Met 


Ala 


Thr 


Lys 


Lys 


Arg 


Ser 


25 










30 






He 


Leu 


Cys 


Gly 


Met 
45 


Gly 


He 


Lys 


Cys 


Leu 


He 


Thr 
60 


Gin 


Leu 


Ala 


Phe 


Val 


Val 


Asn 
75 


Ala 


Ala 


Thr 


Gin 


Gin 
80 


Ala 


Asn 


Ala 


Asn 


Thr 


Val 


Pro 


Tyr 




90 










95 


Gin 


Ser 


Val 


Ala 


Glu 


Arg 


Phe 


Gly 


105 










110 




Leu 


Asn 


Gin 


Phe 


Arg 
125 


Thr 


Phe 


Ala 


Gly 


Asp Glu 


Leu 


Asp 


Val 


Pro 


Ala 








140 










Pro 


Pro 


Pro Gly 


Asn 


Ser 


Ser 


Asp 






155 










160 


Thr 


Ser 


Gin 


Gin 


He 


Gly Ser 


Leu 




170 










175 




Gin 


Ala 


Ala 


Asn 


Met Ala Arg Gly 


185 










190 






Ala 


Met 


Thr Asp 


Trp 


Leu Ser Arg 
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195 








200 


205 






Phe Gly Thr 


Ala 


Arg He 


Thr 


Leu 


Gly Val Asp Glu Asp Phe Ser Leu 


210 








215 




220 






Lys Asn 


Ser 


Gin 


Phe Asp 


Phe 


Leu 


His Pro Trp Tyr Glu 


Thr 


Pro Asp 


.225 






230 






235 




240 


Asn Leu 


Phe 


Pne 


Ser Gin 


His 


Thr 


Leu His Arg Thr Asp 


Glu 


Arg Thr 








245 






250 




255 


Gin lie 


Asn 


Asn 


Gly Leu 


Gly 


Trp 


Arg His Phe Thr Pro 


Thr 


Trp Met 






260 








265 


270 


Ser Gly 


He 


Asn 


Phe Phe 


Phe 


Asp 


His Asp Leu Ser Arg Tyr His Ser 




275 








280 


285 






Arg Ala 


Gly 


He 


Gly Ala Glu 


Tyr 


Trp Arg Asp Tyr Leu Lys 


Leu Ser 


290 








295 




300 






Ser Asn 


Gly 


Tyr 


Leu Arg 


Leu 


Thr 


Asn Trp Arg Ser Ala 


Pro 


Glu Leu 


305 






310 






315 




320 


Asp Asn Asp 


Tyr 


Glu Ala Arg 


Pro 


Ala Asn Gly Trp Asp Val Arg Ala 








325 






330 




335 


Glu Ser 


Trp 


Leu 


Pro Ala 


Trp 


Pro 


His Leu Gly Gly Lys 


Leu 


Val Tyr 






340 








345 


350 




Glu Gin 


Tyr 


Tyr 


Gly Asp 


Glu 


Val 


Ala Leu Phe Asp Lys 


Asp Asp Arg 




355 








360 


365 






Gin Ser 


Asn 


Pro 


His Ala 


He 


Thr 


Ala Gly Leu Asn Tyr 


Thr 


Pro Phe 


370 








375 




380 






Pro Leu 


Met 


Thr 


Phe Ser 


Ala 


Glu 


Gin Arg Gin Gly Lys 


Gin 


Gly Glu 


385 






390 






395 




400 


Asn Asp Thr 


Arg 


Phe Ala 


Val 


Asp 


Phe Thr Trp Gin Pro 


Gly 


Ser Ala 








405 






410 




415 


Met Gin 


Lys 


Gin 


Leu Asp 


Pro 


Asn 


Glu Val Ala Ala Arg 


Arg 


Ser Leu 






420 








425 


430 




Ala Gly Ser 


Arg 


Tyr Asp Leu Val 


Asp Arg Asn Asn Asn 


He 


Val Leu 




435 








440 


445 






Glu Tyr Arg 


Lys 


Lys Glu Leu 


Val 


Arg Leu Thr Leu Thr 


Asp 


Pro Val 


450 








455 




460 






Thr Gly Lys 


Ser 


Gly Glu 


Val 


Lys 


Ser Leu Val Ser Ser 


Leu 


Gin Thr 


4 65 






470 






475 




480 


Lys Tyr 


Ala 


Leu 


Lys Gly 


Tyr 


Asn 


Val Glu Ala Thr Ala 


Leu 


Glu Ala 








485 






490 




495 


Ala Gly Gly 


Lys 


Val Val 


Thr 


Thr 


Gly Lys Asp He Leu 


Val 


Thr Leu 






500 








505 


510 




Pro Ala 


Tyr 


Arg 


Phe Thr 


Ser 


Thr 


Pro Glu Thr Asp Asn 


Thr 


Trp Pro 




515 








520 


525 




He Glu 


Val 


Thr 


Ala Glu Asp Val 


Lys Gly Asn Leu Ser 


Asn 


Arg Glu 


530 








535 




540 




Gin Ser 


Met 


Val 


Val Val 


Gin 


Ala 


Pro Thr Leu Ser Gin 


Lys 


Asp Ser 


545 






550 






555 


560 


Ser Val 


Ser 


Leu 


Ser Thr 


Gin 


Thr 


Leu Asn Ala Asp Ser 


His 


Ser Thr 








565 






570 




575 


Ala Thr 


Leu 


Thr 


Phe He 


Ala 


His 


Asp Ala Ala Gly Asn 


Pro 


Val Val 






580 








585 


590 




Gly Leu 


Val 


Leu 


Ser Thr 


Arg 


His 


Glu Gly Val Gin Asp 


He 


Thr Leu 




595 








600 


605 






Ser Asp 


Trp 


Lys 


Asp Asn 


Gly Asp 


Gly Ser Tyr Thr Gin 


He 


Leu Thr 


610 








615 




620 






Thr Gly Ala 


Met 


Ser Gly Thr 


Leu 


Thr Leu Met Pro Gin 


Leu Asn Gly 


625 






630 






635 




640 


Val Asp Ala 


Ala 


Lys Ala 


Pro 


Ala 


Val Val Asn He He 


Ser 


Val Ser 








645 






650 




655 


Ser Ser Arg 


Thr 


His Ser 


Ser 


He 


Lys He Asp Lys Asp Arg 


Tyr Leu 






660 








665 


670 




Ser Gly Asn 


Pro 


He Glu 


Val 


Thr 


Val Glu Leu Arg Asp 


Glu 


Asn Asp 




675 








680 


685 
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Lvs 


Pro 


Val 


Lys 


Glu 


Gin 


Lys 


Gin 


Gin 


Leu 


Asn 


Asn 


Ala 


Val 


Ser 


He 




690 










695 










700 










Asp 


Asn 


Val 


Lys 


Pro 


Gly 


Val 


Thr 


Thr 


Asp 


Trp 


Lys 


Glu 


Thr 


Ala 


Asp 


705 










710 










715 










720 


Gly 


Val 


Tyr 


Lys 


Ala 


Thr 


Tyr 


Thr 


Ala 


Tyr 


Thr 


Lys 


Gly 


Ser 


Gly 


Leu 










725 










730 










735 




Thr 


Ala 


Lys 


Leu 


Leu 


Met 


Gin 


Asn 


Trp 


Asn 


Glu 


Asp 


Leu 


His 


Thr 


Ala 








740 










745 










750 






Gly 


Phe 


He 


He 


Asp 


Ala 


Asn 


Pro 


Gin 


Ser 


Ala 


Lys 


He 


Ala 


Thr 


Leu 






755 










7 60 










765 








Ser 


Ala 


Ser 


Asn 


Asn 


Gly 


Val 


Leu 


Ala 


Asn 


Glu 


Asn 


Ala 


Ala 


Asn 


Thr 




770 










775 










780 










Val 


Ser 


Val 


Asn 


Val 


Ala 


Asp 


Glu 


Gly 


Ser 


Asn 


Pro 


He 


Asn 


Asp 


His 


785 










790 










795 










800 


Thr 


Val 


Thr 


Phe 


Ala 


Val 


Leu 


Ser 


Gly 


Ser 


Ala 


Thr 


Ser 


Phe 


Asn 


Asn 










805 










810 










815 




Gin 


Asn 


Thr 


Ala 


Lys 


Thr 


Asp 


Val 


Asn 


Gly 


Leu 


Ala 


Thr 


Phe 


Asp 


Leu 








820 










825 










830 






Lvs 


Ser 


Ser 


Lvs 


Gin 


Glu 


Asp 


Asn 


Thr 


Val 


Glu 


Val 


Thr 


Leu 


Glu 


Asn 




835 










840 










845 








Glv 


Val 


Lys 


Gin 


Thr 


Leu 


He 


Val 


Ser 


Phe 


Val 


Gly 


Asp 


Ser 


Ser 


Thr 




850 










855 










860 










Ala 


Gin 


Val 


Asp 


Leu 


Gin 


Lys 


Ser 


Lys 


Asn 


Glu 


Val 


Val 


Ala 


Asp 


Gly 


865 










870 










875 










880 


Asn 


Asp 


Ser 


Val 


Thr 


Met 


Thr 


Ala 


Thr 


Val 


Arg 


Asp 


Ala 


Lys 


Gly 


Asn 










885 










890 










895 




Leu 


Leu 


Asn 


Asp 


Val 


Met 


Val 


Thr 


Phe 


Asn 


Val 


Asn 


Ser 


Ala 


Glu 


Ala 








900 










905 










910 






Lvs 


Leu 


Ser 


Gin 


Thr 


Glu 


Val 


Asn 


Ser 


His 


Asp 


Gly 


He 


Ala 


Thr 


Ala 




915 










920 










925 








Thr 


Leu 


Thr 


Ser 


Leu 


Lys 


Asn 


Gly 


Asp 


Tyr 


Arg 


Val 


Thr 


Ala 


Ser 


Val 




930 










935 










940 










Ser 


Ser 


Gly 


Ser 


Gin 


Ala 


Asn 


Gin 


Gin 


Val 


Asn 


Phe 


He 


Gly 


Asp 


Gin 


945 










950 










955 










960 


Ser 


Thr 


Ala 


Ala 


Leu 


Thr 


Leu 


Ser 


Val 


Pro 


Ser 


Gly 


Asp 


He 


Thr 


Val 










965 










970 










975 




Thr 


Asn 


Thr 


Ala 


Pro 


Gin 


Tyr 


Met 


Thr 


Ala 


Thr 


Leu 


Gin 


Asp 


Lys 


Asn 








980 










985 










990 






Gly 


Asn 


Pro 


Leu 


Lys 


Asp 


Lys 


Glu 


He 


Thr 


Phe 


Ser 


Val 


Pro 


Asn 


Asp 




995 










1000 








1005 






Val 


Ala 


Ser 


Lys 


Phe 


Ser 


He 


Ser 


Asn 


Gly 


Gly 


Lys 


Gly Met 


Thr 


Asp 




1010 








1015 








1020 








Ser 


Asn 


Gly 


Val 


Ala 


He 


Ala 


Ser 


Leu 


Thr 


Gly 


Thr 


Leu 


Ala 


Gly 


Thr 


1025 








1030 








1035 








104C 


His 


Met 


He 


Met 


Ala 


Arg 


Leu 


Ala 


Asn 


Ser 


Asn 


Val 


Ser Asp 


Ala 


Gin 










1045 








1050 








1055 


Pro 


Met 


Thr 


Phe 


Val 


Ala Asp 


Lys 


Asp 


Arg Ala 


Val 


Val 


Val 


Leu 


Gin 








1060 








1065 








1070 




Thr 


Ser 


Lys 


Ala 


Glu 


lie 


He 


Gly Asn 


Gly Val 


Asp Glu 


Thr 


Thr 


Leu 






1075 








1080 








1085 






Thr 


Ala 


Thr 


Val 


Lys 


Asp 


Pro 


Ser 


Asn 


His 


Pro 


Val 


Ala 


Gly 


He 


Thr 




1090 








1095 








1100 








Val 


Asn 


Phe 


Thr 


Met 


Pro Gin Asp Val 


Ala 


Ala 


Asn 


Phe 


Thr 


Leu 


Glu 


1105 








1110 








1115 








112C 


Asn 


Asn 


Gly 


He 


Ala 


He 


Thr 


Gin 


Ala 


Asn 


Gly Glu Ala 


His 


Val 


Thr 










1125 








1130 








1135 


Leu 


Lys 


Gly Lys 


Lys Ala Gly Thr His 


Thr 


Val 


Thr 


Ala 


Thr 


Leu Gly 








1140 








1145 








1150 




Asn 


Asn 


Asn 


Thr 


Ser Asp 


Ser 


Gin 


Pro 


Val 


Thr 


Phe 


Val 


Ala 


Asp Lys 






1155 








1160 








1165 






Ala 


Ser 


Ala 


Gin 


Val 


Val 


Leu 


Gin 


He 


Ser 


Lys 


Asp Glu 


He 


Thr Gly 



-135- 



WO 00/44906 



PCTAJSOO/02200 



1170 1175 1180 

Asn Gly Val Asp Ser Ala Thr Leu Thr Ala Thr Val Lys Asp Gin Phe 
1185 1190 1195 1200 

Asp Asn Glu Val Asn Asn Leu Pro Val Thr Phe Ser Ser Ala Ser Ser 

1205 1210 1215 

Gly Leu Thr Leu Thr Pro Gly Val Ser Asn Thr Asn Glu Ser Gly lie 

1220 1225 1230 

Ala Gin Ala Thr Leu Ala Gly Val Ala Phe Gly Glu Lys Thr Val Thr 

1235 1240 1245 

Ala Ser Leu Ala Asn Asn Gly Ala Ser Asp Asn Lys Thr Val His Phe 

1250 1255 1260 

lie Gly Asp Thr Ala Ala Ala Lys lie lie Glu Leu Ala Pro Val Pro 
1265 1270 1275 1280 

Asp Ser lie lie Ala Gly Thr Pro Gin Asn Ser Ser Gly Ser Val lie 

1285 1290 1295 

Thr Ala Thr Val Val Asp Asn Asn Gly Phe Pro Val Lys Gly Val Thr 

1300 1305 1310 

Val Asn Phe Thr Ser Asn Ala Ala Thr Ala Glu Met Thr Asn Gly Gly 

1315 1320 1325 

Gin Ala Val Thr Asn Glu Gin Gly Lys Ala Thr Val Thr Tyr Thr Asn 

1330 1335 1340 

Thr Arg Ser Ser lie Glu Ser Gly Ala Arg Pro Asp Thr Val Glu Ala 
1345 " 1350 1355 1360 

Ser Leu Glu Asn Gly Ser Ser Thr Leu Ser Thr Ser lie Asn Val Asn 

1365 1370 1375 

Ala Asp Ala Ser Thr Ala His Leu Thr Leu Leu Gin Ala Leu Phe Asp 

1380 1385 1390 

Thr Val Ser Ala Gly Glu Thr Thr Ser Leu Tyr lie Glu Val Lys Asp 

1395 1400 1405 

Asn Tyr Gly Asn Gly Val Pro Gin Gin Glu Val Thr Leu Ser Val Ser 

1410 1415 1420 

Pro Ser Glu Gly Val Thr Pro Ser Asn Asn Ala lie Tyr Thr Thr Asn 
1425 1430 1435 1440 

His Asp Gly Asn Phe Tyr Ala Ser Phe Thr Ala Thr Lys Ala Gly Val 

1445 1450 1455 

Tyr Gin Leu Thr Ala Thr Leu Glu Asn Gly Asp Ser Met Gin Gin Thr 

1460 1465 1470 

Val Thr Tyr Val Pro Asn Val Ala Asn Ala Glu He Thr Leu Ala Ala 

1475 1480 1485 

Ser Lys Asp Pro Val He Ala Asp Asn Asn Asp Leu Thr Thr Leu Thr 

1490 1495 1500 

Ala Thr Val Ala Asp Thr Glu Gly Asn Ala He Ala Asn Thr Glu Val 
1505 1510 1515 1520 

Thr Phe Thr Leu Pro Glu Asp Val Lys Ala Asn Phe Thr Leu Ser Asp 

1525 1530 1535 

Gly Gly Lys Val He Thr Asp Ala Glu Gly Lys Ala Lys Val Thr Leu 

1540 1545 1550 

Lys Gly Thr Lys Ala Gly Ala His Thr Val Thr Ala Ser Met Thr Gly 

1555 1560 1565 

Gly Lys Ser Glu Gin Leu Val Val Asn Phe He Ala Asp Thr Leu Thr 

1570 1575 1580 

Ala Gin Val Asn Leu Asn Val Thr Glu Asp Asn Phe He Ala Asn Asn 
1585 1590 1595 1600 

Val Gly Met Thr Arg Leu Gin Ala Thr Val Thr Asp Gly Asn Gly Asn 

1605 1610 1615 

Pro Leu Ala Asn Glu Ala Val Thr Phe Thr Leu Pro Ala Asp Val Ser 

1620 1625 1630 

Ala Ser Phe Thr Leu Gly Gin Gly Gly Ser Ala lie Thr Asp He Asn 

1635 1640 1645 

Gly Lys Ala Glu Val Thr Leu Ser Gly Thr Lys Ser Gly Thr Tyr Pro 
1650 1655 1660 
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Val Thr Val Ser Val Asn Asn Tyr Gly Val Ser Asp Thr Lys Gin Val 
1665 1670 1675 1680 

Thr Leu lie Ala Asp Ala Gly Thr Ala Lys Leu Ala Ser Leu Thr Ser 

1685 1690 1695 

Val Tyr Ser Phe Val Val Ser Thr Thr Glu Gly Ala Thr Met Thr Ala 

1700 1705 1710 

Ser Val Thr Asp Ala Asn Gly Asn Pro Val Glu Gly lie Lys Val Asn 

1715 1720 1725 

Phe Arg Gly Thr Ser Val Thr Leu Ser Ser Thr Ser Val Glu Thr Asp 

1730 1735 1740 

Asp Arg Gly Phe Ala Glu He Leu Val Thr Ser Thr Glu Val Gly Leu 
1745 1750 1755 1760 

Lys Thr Val Ser Ala Ser Leu Ala Asp Lys Pro Thr Glu Val He Ser 

1765 1770 1775 

Arg Leu Leu Asn Ala Ser Ala Asp Val Asn Ser Ala Thr He Thr Ser 

1780 1785 1790 

Leu Glu He Pro Glu Gly Gin Val Met Val Ala Gin Asp Val Ala Val 

1795 1800 1805 

Lys Ala His Val Asn Asp Gin Phe Gly Asn Pro Val Ala His Gin Pro 

1810 1815 1820 

Val Thr Phe Ser Ala Glu Pro Ser Ser Gin Met He He Ser Gin Asn 
1825 1830 1835 1840 

Thr Val Ser Thr Asn Thr Gin Gly Val Ala Glu Val Thr Met Thr Pro 

1845 1850 1855 

Glu Arg Asn Gly Ser Tyr Met Val Lys Ala Ser Leu Pro Asn Gly Ala 

1860 1865 1870 

Ser Leu Glu Lys Gin Leu Glu Ala He Asp Glu Lys Leu Thr Leu Thr 

1875 1880 1885 

Ala Ser Ser Pro Leu He Gly Val Tyr Ala Pro Thr Gly Ala Thr Leu 

1890 1895 1900 

Thr Ala Thr Leu Thr Ser Ala Asn Gly Thr Pro Val Glu Gly Gin Val 
1905 1910 1915 ~ 1920 

He Asn Phe Ser Val Thr Pro Glu Gly Ala Thr Leu Ser Gly Gly Lys 

1925 1930 1935 

Val Arg Thr Asn Ser Ser Gly Gin Ala Pro Val Val Leu Thr Ser Asn 

1940 1945 1950 

Lys Val Gly Thr Tyr Thr Val Thr Ala Ser Phe His Asn Gly Val Thr 

1955 1960 1965 

He Gin Thr Gin Thr Thr Val Lys Val Thr Gly Asn Ser Ser Thr Ala 

1970 1975 1980 

His Val Ala Ser Phe He Ala Asp Pro Ser Thr He Ala Ala Thr Asn 
1985 1990 1995 2000 

Thr Asp Leu Ser Thr Leu Lys Ala Thr Val Glu Asp Gly Ser Gly Asn 

2005 2010 2015 

Leu He Glu Gly Leu Thr Val Tyr Phe Ala Leu Lys Ser Gly Ser Ala 

2020 2025 2030 

Thr Leu Thr Ser Leu Thr Ala Val Thr Asp Gin Asn Gly He Ala Thr 

2035 2040 2045 

Thr Ser Val Lys Gly Ala Met Thr Gly Ser Val Thr Val Ser Ala Val 

2050 2055 2060 

Thr Thr Ala Gly Gly Met Gin Thr Val Asp He Thr Leu Val Ala Gly 
2065 2070 2075 2080 

Pro Ala Asp Thr Ser Gin Ser Val Leu Lys Ser Asn Arg Ser Ser Leu 

2085 2090 2095 

Lys Gly Asp Tyr Thr Asp Ser Ala Glu Leu Arg Leu Val Leu His Asp 

2100 2105 2110 

He Ser Gly Asn Pro lie Lys Val Ser Glu Gly Met Glu Phe Val Gin 

2115 2120 2125 

Ser Gly Thr Asn Val Pro Tyr lie Lys lie Ser Ala lie Asp Tyr Ser 

2130 2135 2140 

Leu Asn lie Asn Gly Asp Tyr Lys Ala Thr Val Thr Gly Gly Gly Glu 
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2145 2150 2155 2160 

Gly lie Ala Thr Leu He Pro Val Leu Asn Gly Val His Gin Ala Gly 

2165 2170 2175 

Leu Ser Thr Thr He Gin Phe Thr Arg Ala Glu Asp Lys He Met Ser 

2180 2185 2190 

Gly Thr Val Ser Val Asn Gly Thr Asp Leu Pro Thr Thr Thr Phe Pro 

2195 2200 2205 

Ser Gin Gly Phe Thr Gly Ala Tyr Tyr Gin Leu Asn Asn Asp Asn Phe 

2210 2215 2220 

Ala Pro Gly Lys Thr Ala Ala Asp Tyr Glu Phe Ser Ser Ser Ala Ser 
2225 2230 2235 2240 

Trp Val Asp Val Asp Ala Thr Gly Lys Val Thr Phe Lys Asn Val Gly 

2245 2250 ' 2255 

Ser Asn Ser Glu Arg He Thr Ala Thr Pro Lys Ser Gly Gly Pro Ser 

2260 2265 2270 

Tyr Val Tyr Glu He Arg Val Lys Ser Trp Trp Val Asn Ala Gly Glu 

2275 2280 2285 

Ala Phe Met He Tyr Ser Leu Ala Glu Asn Phe Cys Ser Ser Asn Gly 

2290 2295 2300 

Tyr Thr Leu Pro Arg Ala Asn Tyr Leu Asn His Cys Ser Ser Arg Gly 
2305 2310 2315 2320 

He Gly Ser Leu Tyr Ser Glu Trp Gly Asp Met Gly His Tyr Thr Thr 

2325 2330 ' 2335 

Asp Ala Gly Phe Gin Ser Asn Met Tyr Trp Ser Ser Ser Pro Ala Asn 

2340 2345 2350 

Ser Ser Glu Gin Tyr Val Val Ser Leu Ala Thr Gly Asp Gin Ser Val 

2355 2360 2365 

Phe Glu Lys Leu Gly Phe Ala Tyr Ala Thr Cys Tyr Lys Asn Leu 
2370 2375 2380 





<210> 


303 








<211> 


61 
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<400> 


303 






Met 


Ser Lys 


Gly 


Ala Leu Tyr 


Glu 


1 






5 




He 


Pro Leu 


Pro 


His Lys His 


He 






20 






Ser 


Lys Asp 


Val 


Gly Tyr Ala 


Tyr 




35 






40 


Leu 


Lys Thr 


His 


Ser Leu Arg 


Leu 



50 55 



Phe Asn Asn Pro Asp Gin Leu Lys 

10 15 
Ala Ser Thr Phe Asn Asp lie Met 
25 30 
Val Ser Leu Leu Tyr Ala Cys Pro 
45 

Asn Pro Phe Ser Lys 
60 





<210> 


304 








<211> 


398 








<212> 


PRT 








<213> 


E . Coli 








<400> 


304 






Met 


Gin Val 


Ala Glu Gin 


Arg 


He 


1 




5 






Ala 


Val Ala 


Thr Gin Asp 


Gly 


Pro 






20 






Glu 


Arg Gin 


Lys Met Ser 


Ala 


Glu 




35 






40 


Asn 


Asp Pro 


Ala Ala Gly 


Thr 


Thr 




50 




55 





Gin Leu Ala Glu Ala Gin Ala Lys 

10 15 
Gin He Asp Phe Ser Ala Asp Met 
25 30 
Gly Leu Met Gly Pro Phe Ala Leu 
45 

Gly Pro Trp Tyr Thr Asn Gly Thr 
60 



-138- 



WO 00/44906 



PCT/USOO/02200 



Phe Gly 


Leu 


Thr 


Ala 


Gly Trp 


His Leu Asp He 


Trp Gly Lys 


Asn 


Ara 


65 










70 




75 








80 


Ala 


Glu 


Val 


Thr 


Ala 


Arg 


Leu 


Gly Thr Val Lys 


Ala Arg Ala 


Ala 


Glu 










85 






90 






95 




Arg 


Glu 


Gin 


Thr 


Arg 


Gin 


Leu 


Leu Ala Gly Ser 


Val Ala 


Arg 


Leu 


Tyr 








100 








105 




110 




Trp Glu 


Trp Gin 


Thr 


Gin 


Ala 


Ala Leu Asn Thr 


Val Leu 


Gin 


Gin 


lie 






115 










120 


125 








Glu 


Lys 


Glu 


Gin 


Asn 


Thr 


He 


He Ala Thr Asp 


Arg Gin 


Leu 


Tyr 


Gin 




130 










135 




140 








Asn 


Gly 


He 


Thr 


Ser 


Ser 


Val 


Glu Gly Val Glu 


Thr Asp 


lie 


Asn 


Ala 


145 










150 




155 








160 


Ser 


Lys 


Thr Arg 


Gin 


Gin 


Leu 


Asn Asp Val Ala 


Gly Lys 


Met 


Lys 


lie 










165 






170 






175 




lie 


Glu 


Ala 


Arg 


Leu 


Ser 


Ala 


Leu Thr Asn Asn 


Gin Thr 


Lys 


Ser 


Leu 








180 








185 




190 






Lys 


Leu 


Lys 


Pro 


Val 


Ala 


Leu 


Pro Lys Val Ala 


Ser Gin 


Leu 


Pro 


Asp 






195 










200 


205 






Glu 


Leu 


Gly Tyr 


Ser 


Leu 


Leu 


Ala Arg Arg Ala 


Asp Leu 


Gin 


Ala 


Ala 




210 










215 




220 








His 


Trp 


Tyr 


Val 


Glu 


Ser 


Ser 


Leu Ser Thr He 


Asp Ala Ala 


Lys 


Ala 


225 










230 




235 








240 


Ala 


Phe 


Tyr 


Pro 


Asp 


lie 


Asn 


Leu Met Ala Phe 


Leu Gin 


Gin 


Asp 


Ala 










245 






250 






255 




Leu 


His 


Leu 


Ser 


Asp 


Leu 


Phe 


Arg His Ser Ala 


Gin Gin 


Met 


Gly 


Val 








260 








265 




270 






Thr 


Ala 


Gly 


Leu 


Thr 


Leu 


Pro 


He Phe Asp Ser 


Gly Arg Leu 


Asn 


Ala 






275 










280 


285 








Asn 


Leu 


Asp 


He 


Ala 


Lys Ala 


Glu Ser Asn Leu 


Ser He 


Ala 


Ser 


Tyr 




290 










295 




300 






Asn 


Lys 


Ala 


Val 


Val 


Glu 


Ala 


Val Asn Asp Val 


Ala Arg Ala 


Ala 


Ser 


305 










310 




315 








320 


Gin 


Val 


Gin 


Thr 


Leu 


Ala 


Glu 


Lys Asn Gin His 


Gin Ala 


Gin 


lie 


Glu 










325 






330 






335 




Arg Asp 


Ala 


Leu 


Arg 


Val 


Val 


Gly Leu Ala Gin 


Ala Arg 


Phe 


Asn 


Ala 








340 








345 




350 






Gly lie 


He 


Ala 


Gly 


Ser Arg 


Val Ser Glu Ala 


Arg He 


Pro 


Ala 


Leu 






355 










360 


365 








Arg 


Glu 


Arg Ala 


Asn 


Gly 


Leu 


Leu Leu Gin Gly 


Gin Trp 


Leu 


Asp 


Ala 




370 










375 




380 








Ser 


He 


Gin 


Leu 


Thr 


Gly Ala 


Leu Gly Gly Gly 


Tyr Lys 


Arg 






385 










390 




395 
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<400> 305 



Met 


Tyr 


Cys 


His 


Ala 


Lys 


Leu 


Lys 


Asn 


lie 


Ser 


Gin 


His 


Thr 


Val 


lie 


1 








5 










10 










15 




Ser 


Ala 


His 


Leu 


Phe 


Leu 


Pro 


Asp 


Tyr 


Ser 


Pro 


Met 


Asn Arg 


Asp 


Ser 








20 










25 










30 






Phe 


Tyr 


Pro 


Ala 


lie 


Ala 


Cys 


Phe 


Pro 


Leu 


Leu 


Leu 


Met 


Leu 


Ala 


Gly 






35 










40 










45 






Cys 


Ala 
50 


Pro 


Met 


His 


Glu 


Thr 
55 


Arg 


Gin 


Ala 


Leu 


Ser 
60 


Gin 


Gin 


Thr 


Pro 


Ala 


Ala 


Gin 


Val 


Asp 


Thr 


Ala 


Leu 


Pro 


Thr 


Ala 


Leu 


Lys 


Met 


Val 


Gly 


65 










70 










75 










80 


Gin 


Thr 


Ala 


Asn 


Gly 


Gly 


Trp 


Ser 


lie 


Thr 


lie 


lie 


Asn 


Ser 


Leu 


Pro 
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<210> 306 
<211> 315 
<212> PRT 
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<400> 


306 




















Met Arg 


Val 


Leu 


Leu 


Ala 


Pro 


Met 


Glu Gly Val 


Leu Asp 


Ser Leu 


Val 


1 






5 








10 






15 




Arg Giu 


Leu 


Leu 


Thr 


Glu 


Val 


Asn 


Asp Tyr Asp 


Leu 


Cys 


He Thr 


Glu 






20 










25 






30 




Phe Val 


Arg 


Val 


Val 


Asp 


Gin 


Leu 


Leu Pro Val 


Lys 


Val 


Phe His 


Arg 




35 










40 






45 




lie Cys 


Pro 


Glu 


Leu 


Gin 


Asn 


Ala 


Ser Arg Thr 


Pro 


Ser 


Gly Thr 


Leu 


50 










55 






60 








Val Arg 


Val 


Gin 


Leu 


Leu 


Gly Gin 


Phe Pro Gin 


Trp 


Leu 


Ala Glu 


Asn 


65 








70 






75 








80 


Ala Ala 


Arg 


Ala 


Val 


Glu 


Leu 


Gly 


Ser Trp Gly 


Val 


Asp 


Leu Asn 


Cys 








85 








90 






95 




Gly Cys 


Pro 


Ser 


Lys 


Thr 


Val 


Asn 


Gly Ser Gly 


Gly Gly Ala Thr 


Leu 






100 










105 






110 




Leu Lys 


Asp 


Pro 


Glu 


Leu 


He 


Tyr Gin Gly Ala 


Lys 


Ala 


Met Arg 


Glu 




115 










120 






125 






Ala Val 


Pro 


Ala 


His 


Leu 


Pro 


Val 


Ser Val Lys 


Val Arg Leu Gly 


Trp 


130 










135 






140 








Asp Ser 


Gly 


Glu 


Lys 


Lys 


Phe 


Glu 


He Ala Asp 


Ala 


Val 


Gin Gin 


Ala 


145 








150 






155 








160 


Gly Ala 


Thr 


Glu 


Leu 


Val 


Val 


His Gly Arg Thr 


Lys 


Glu 


Gin Gly 


Tyr 








165 








170 






175 




Arg Ala 


Glu 


His 


He 


Asp 


Trp 


Gin 


Ala He Gly 


Asp 


He Arg Gin 


Arg 






180 










185 






190 




Leu Asn 


He 


Pro 


Val 


He 


Ala 


Asn 


Gly Glu He 


Trp Asp 


Trp Gin 


Ser 




195 










200 






205 






Ala Gin 


Gin 


Cys 


Met 


Ala 


He 


Ser 


Gly Cys Asp 


Ala 


Val 


Met He 


Gly 


210 










215 






220 








Arg Gly Ala 


Leu 


Asn 


He 


Pro 


Asn 


Leu Ser Arg 


Val 


Val 


Lys Tyr 


Asn 


225 








230 






235 








240 


Glu Pro Arg 


Met 


Pro 


Trp 


Pro 


Glu 


Val Val Ala 


Leu 


Leu 


Gin Lys 


Tyr 








245 








250 






255 




Thr Arg 


Leu 


Glu 


Lys 


Gin 


Gly Asp Thr Gly Leu 


Tyr 


His 


Val Ala 


Arg 






260 










265 






270 




lie Lys 


Gin 
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Ala Met Ala Arg Thr Val Gly Trp 
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Met Lys Val Thr Leu Pro Glu Phe Glu Arg Ala Gly Val Met Val Val 
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Gly Asp Val Met Leu Asp Arg Tyr Trp Tyr Gly Pro Thr Ser Arg He 
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Ser Pro Glu Ala Pro Val Pro Val Val Lys Val Asn Thr He Glu Glu 
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Arg Pro Gly Gly Ala Ala Asn Val Ala Met Asn He Ala Ser Leu Gly 

50 55 60 

Ala Asn Ala Arg Leu Val Gly Leu Thr Gly He Asp Asp Ala Ala Arg 
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Ala Leu Ser Lys Ser Leu Ala Asp Val Asn Val Lys Cys Asp Phe Val 
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Ser Val Pro Thr His Pro Thr He Thr Lys Leu Arg Val Leu Ser Arg 
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180 185 190 
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Pro He Glu Leu Glu Asn Ala Val Arg Gly Arg Ala Asp Thr Gly Phe 
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Gly Val Met Thr Glu Glu Glu Leu Lys Leu Ala Val Ala Ala Ala Arg 
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Lys Arg Gly Glu Lys Val Val Met Thr Asn Gly Val Phe Asp He Leu 
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His Ala Gly His Val Ser Tyr Leu Ala Asn Ala Arg Lys Leu Gly Asp 
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Arg Leu lie Val Ala Val Asn Ser Asp Ala Ser Thr Lys Arg Leu Lys 
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Gin Val Ala Gly Phe Ser Thr Asp Tyr Leu He Asp Leu Val Thr Arg 

100 105 110 

Phe He Asn Trp Gin Met He Gly Ala He Phe Val Leu Leu Val Ala 

115 120 125 

Trp Leu Phe Leu Ser Gin Trp He Arg He Thr Val Phe Val Val Ala 

130 135 140 

He Leu Leu Trp Leu Asn Val Leu Thr Leu Ala Gly Pro Ser Phe Ser 
145 150 155 160 

Leu Trp Pro Ala Gly Gin Pro Thr Thr Thr Val Thr Thr Thr Gly Gly 

165 170 175 

Asn Ala Ala Ala Thr Val Ala Ala Thr Gly Gly Ala Pro Val Val Gly 

180 185 190 

Asp Met Pro Ala Gin Thr Ala Pro Pro Thr Thr Ala Asn Leu Asn Ala 

195 200 205 

Trp Leu Asn Asn Phe Tyr Asn Ala Glu Ala Lys Arg Lys Ser Thr Phe 

210 215 220 

Pro Ser Ser Leu Pro Ala Asp Ala Gin Pro Phe Glu Leu Leu Val He 
225 230 235 240 

Asn He Cys Ser Leu Ser Trp Ser Asp He Glu Ala Ala Gly Leu Met 

245 250 255 

Ser His Pro Leu Trp Ser His Phe Asp He Glu Phe Lys Asn Phe Asn 

260 265 270 

Ser Ala Thr Ser Tyr Ser Gly Pro Ala Ala He Arg Leu Leu Arg Ala 

275 280 285 

Ser Cys Gly Gin Thr Ser His Thr Asn Leu Tyr Gin Pro Ala Asn Asn 

290 295 300 

Asp Cys Tyr Leu Phe Asp Asn Leu Ser Lys Leu Gly Phe Thr Gin His 
305 310 315 320 

Leu Met Met Gly His Asn Gly Gin Phe Gly Gly Phe Leu Lys Glu Val 
325 330 335 
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Arg 


Glu 


Asn Gly 


Gly Met Gin 


Ser 


Glu 






340 
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fro 


vai 


He Leu 


Leu 


Gly Phe 


Asp 


Gly 






355 . 






360 




Ala 


val 


Leu Asn 


Arg 


Trp Leu 


Asp 


Val 




*3 "7 A 

370 






375 






TV «•» 

Arg 


Ser 


Ala Thr 


Phe 


Tyr Asn 


Thr 


Leu 
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Tyr 


Pro 


Gly Val 


Ser 


Lys Thr 


Ala 


Asp 








405 








Phe 


Phe 


Asp Glu 


Leu Asp Ala 


Phe 


Phe 






420 








425 


Arg 


Lys 


Val Met 


Val 


Val Val 


Val 


Pro 






435 






440 




Gly Asp 


Arg Met 


Gin 


Val Ser 


Gly 


Leu 




450 






455 






He 


Thr 


Asp Val 


Pro 


Val Gly 


Val 


Lys 


4 65 








470 






His 


Gin 


Gly Ala 


Pro 


He Val 


He 


Glu 








485 








He 


Ser 


Asp Leu 


Val Val Arg Val 


Leu 






500 








505 


Asp Asn 


Val Asp 


Trp 


Lys Lys 


Leu 


Thr 






515 






520 




Pro 


Val 


Ser Glu 


Asn 


Ser Asn 


Ala 


Val 




530 






535 






Pro 


Tyr 


Val Arg 


Leu 


Asn Gly Gly Asp 


545 








550 







<210> 332 
<211> 127 
<212> PRT 
<213> E. Coli 





<400> 332 








Met 


Glu 


Gly Ser 


Arg Met Lys 


Tyr Arg 


1 






5 






Phe 


Ala 


Leu Ser 


Ala Gly Ser 


Tyr 


Ala 






20 






25 


Glu 


Gin 


Asn lie 


Leu Lys Glu 


lie 


Ser 






35 




40 




Gin 


Asn 


Arg lie 


Asp Gly Leu 


Asn 


Lys 




50 




55 






Asn Cys 


Ser Asp 


Ser Gin Leu 


Arg 


Ala 


65 






70 






Lys 


Gin 


Lys Asp 


Glu Val Ala 


Glu 


Arg 








85 






Lys 


Gin 


Lys Gly 


Asp Ala Asp 


Lys 


lie 






100 






105 


Ala 


Glu 


Ala Gin 


Glu Glu Leu 


Lys 


Lys 






115 




120 





<210> 333 
<211> 101 
<212> PRT 
<213> E . Coli 

<400> 333 
Met Ser Lys Glu His Thr Thr Glu His 
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Leu Met Asp Gin Thr Asn Leu 
350 

Ser Pro Val Tyr Asp Asp Thr 
365 

Thr Glu Lys Asp Lys Asn Ser 
380 

Pro Leu His Asp Gly Asn His 
395 400 
Tyr Lys Ala Arg Ala Gin Lys 
410 415 
Thr Glu Leu Glu Lys Ser Gly 
430 

Glu His Gly Gly Ala Leu Lys 
445 

Arg Asp lie Pro Ser Pro Ser 
4 60 

Phe Phe Gly Met Lys Ala Pro 
475 480 
Gin Pro Ser Ser Phe Leu Ala 
490 495 
Asp Gly Lys lie Phe Thr Glu 
510 

Ser Gly Leu Pro Gin Thr Ala 
525 

Val lie Gin Tyr Gin Asp Lys 
540 

Trp Val Pro Tyr Pro Gin 
555 



He Ala Leu Ala Val Ser Leu 
10 15 
Thr Thr Leu Cys Gin Glu Lys 
30 

Tyr Ala Glu Lys His Gin Asn 
45 

Ala Leu Ser Glu Val Arg Ala 
60 

Asp His Gin Lys Lys lie Ala 

75 80 
Gin Gin Asp Leu Ala Glu Ala 
90 95 
Ala Lys Arg Glu Arg Lys Leu 
110 

Leu Glu Ala Arg Asp Tyr 
125 



Leu Arg Ala Glu Leu Lys Ser 
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Leu Ser 


Asp Thr 


Leu 


Glu 


Glu Val Leu 


Ser Ser Ser 


Gly Glu 


Lys 


Ser 
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Lys Glu 


Glu Leu 
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He Arg Ser 


Lys Ala Glu 
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Leu 


Lys 
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40 




45 




Gin Ser 


Arg Tyr 


Arg 


Leu 


Gly Glu Thr 


Gly Asp Ala 


He Ala 


Lys 


Gin 


50 








55 


60 






Thr Arg 


Val Ala 


Ala Ala Arg Ala Asp 


Glu Tyr Val Arg Glu Asn Pro 


65 






70 




75 






80 


Trp Thr 


Gly Val 


Gly 
85 


He 


Gly Ala Ala 


He Gly Val 
90 


Val Leu 


Gly 
95 


Val 


Leu Leu 


Ser Arg 
100 


Arg 
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Met Ala Asp Thr 


His 


His 


Ala 


Gin 


Gly 


Pro Gly Lys Ser Val Leu Gly 


1 
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He Gly Gin Arg 


He 


Val 
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He 


Met 


Val Glu Met Val 


Glu Thr Arg 
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Leu Arg 


Leu Ala 


Val 


Val 


Glu 


Leu 


Glu 


Glu Glu Lys Ala 


Asn Leu Phe 
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Gin Leu 
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Met 
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Gly Leu 


Thr 


Met Leu Phe Ala 


Ala Phe Gly 
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Leu Met 


Ser Leu 


Met 


Val 


Leu 


lie 


He 


Trp Ala Val Asp Pro Gin Tyr 
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Arg Leu 
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Met 


He 


Ala 
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Val Val Leu Leu 


Leu Leu Ala 
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Leu He 
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He 


Trp 
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Leu Arg 


Lys Ser Arg Lys 


Ser Thr Leu 
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Leu Arg 


His Thr 


Arg 
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Glu 
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Asn Asp Arg Gin 


Leu Leu Glu 
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Ser Arg 


Glu 
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Ser 


Ser 


Lys 


Val Glu Arg Glu Arg 


Arg 


Lys Ala Gin Leu 


Leu Ser 
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Gin 


He 


Gin 


Gin 


Gin Arg Leu Asp Leu 


Ser 


Ala Ser Arg Arg Glu Trp 
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Leu 


Glu 
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Gly Ala Tyr Asp Arg 


Arg Trp Asn Met Leu 


Leu Ser 
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Leu 


Arg 


Ser 


Trp 


Ala Leu Val Gly Ser 


Ser 


Val Met Ala He 


Trp Thr 
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He 


Arg 


His 
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Asn Met Leu Val Arg 
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Ala Arg Arg Gly 


Phe Gly 
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Val 


Trp 


Ser 


Ala 


Trp Arg Leu Val Lys 


Thr 


Thr Leu Lys Gin 


Gin Gin 
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95 


Leu Arg 


Gly 
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<210> 336 
<211> 160 
<212> PRT 
<213> E. Coli 



<400> 336 
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Asp 
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He Arg 
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Met 


Asp 


Val 


Leu Thr Gly Leu 


Pro Gly Arg Arg Val 


Leu Asp 




130 
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Gin Leu Arg Asn Ala 
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Pro 


Leu Asn Leu Tyr 


145 










150 
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Met 
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Asp 


He Asp Arg Phe 
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Leu 


Val 


Asn Asp Thr Tyr 
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Gly His 


Leu 


lie 


Gly Asp Val Val 








180 




Ala 


Ser 


Trp 


Thr 


Arg Asp Tyr Glu 






195 




200 


Glu 


Phe 


He 


He 


He Val Lys Ala 




210 






215 


Ala 


Gly 


Val 


Arg 


He Cys Gin Leu 


225 








230 


Ser 


Glu 


Gly 


His 


He Asn He Thr 










245 


Phe 


Pro 


Glu 


Glu 


Pro Leu Asp Val 








260 


Met 


Tyr 


Glu 


Gly 


Lys Gin Thr Gly 






275 




280 


Glu 


Gin 


Asn 


Val 


He Asn Arg Val 




290 






295 



Leu Arg Thr Leu Ala Thr Tyr Leu 
185 190 
Thr Val Tyr Arg Tyr Gly Gly Glu 
205 

Ala Asn Asp Glu Glu Ala Cys Arg 
220 

Val Asp Asn His Ala He Thr His 
235 240 
Val Thr Ala Gly Val Ser Arg Ala 

250 255 
Val He Gly Arg Ala Asp Arg Ala 
265 270 
Arg Asn Arg Cys Met Phe He Asp 
285 



<210> 338 
<211> 203 
<212> PRT 
<213> E. Coli 

<400> 338 

Met Arg Leu Arg Val Val Pro Gly Phe He Ser Pro Pro Pro Gly Phe 

1 5 10 15 

Gly Gly Leu Gly Tyr Thr Pro Thr Ala Arg Ala Cys Val Asn He Ser 

20 25 30 

lie Pro Leu Gin Leu Arg Val He Asp Met Leu Asp Val Phe Thr Pro 

35 40 45 

Leu Leu Lys Leu Phe Ala Asn Glu Pro Leu Glu Arg Leu Met Tyr Thr 

50 55 60 

He He He Phe Gly Leu Thr Leu Trp Leu He Pro Lys Glu Phe Thr 
65 70 75 80 

Val Ala Phe Asn Ala Tyr Thr Glu lie Pro Trp Leu Phe Gin lie lie 

85 90 95 

Val Phe Ala Phe Ser Phe Val Val Ala lie Ser Phe Ser Arg Leu Arg 

100 105 no 

Ala His He Gin Lys His Tyr Ser Leu Leu Pro Glu Gin Arg Val Leu 

115 120 125 

Leu Arg Leu Ser Glu Lys Glu lie Ala Val Phe Lys Asp Phe Leu Lys 

130 135 140 

Thr Gly Asn Leu lie lie Thr Ser Pro Cys Arg Asn Pro Val Met Lys 
145 150 155 160 

Lys Leu Glu Arg Lys Gly lie lie Gin His Gin Ser Asp Ser Ala Asn 

165 170 175 

Cys Ser Tyr Tyr Leu Val Thr Glu Lys Tyr Ser His Phe Met Lys Leu 

180 185 190 

Phe Trp Asn Ser Arg Ser Arg Arg Phe Asn Arg 
195 200 



<210> 339 
<211> 58 
<212> PRT 
<213> E . Coli 

<400> 339 

Met Leu Leu Gin Pro Ser Ala Arg Thr Ser Phe Gly Phe Lys Cys Phe 
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1 5 10 15 

Ala Phe Gly He Arg His Gly Ser Glu Arg Ser He Leu Val Gly Glu 

20 25 30 

His Ala Ala His Gin Gly Phe Val Val Ala Glu Val Asp Phe Leu His 

35 40 45 

Phe Ala Asn Leu Thr Ser Cys Cys Tyr Val 
50 55 



<210> 340 
<2U> 1426 
<212> PRT 
<213> E. Coli 

<400> 340 

Met Ser Gly Lys Pro Ala Ala Arg Gin Gly Asp Met Thr Gin Tyr Gly 

1 5 10 is 

Gly Pro He Val Gin Gly Ser Ala Gly Val Arg He Gly Ala Pro Thr 

20 25 30 

Gly Val Ala Cys Ser Val Cys Pro Gly Gly Met Thr Ser Gly Asn Pro 

35 40 45 

Val Asn Pro Leu Leu Gly Ala Lys Val Leu Pro Gly Glu Thr Asp Leu 

50 55 60 

Ala Leu Pro Gly Pro Leu Pro Phe He Leu Ser Arg Thr Tyr Ser Ser 
65 ™ 75 " 80 

Tyr Arg Thr Lys Thr Pro Ala Pro Val Gly Val Phe Gly Pro Gly Trp 

85 90 95 

Lys Ala Pro Ser Asp He Arg Leu Gin Leu Arg Asp Asp Gly Leu He 

100 105 no 

Leu Asn Asp Asn Gly Gly Arg Ser He His Phe Glu Pro Leu Leu Pro 

H5 120 125 

Gly Glu Ala Val Tyr Ser Arg Ser Glu Ser Met Trp Leu Val Arg Glv 

130 135 140 

Gly Lys Ala Ala Gin Pro Asp Gly His Thr Leu Ala Arg Leu Trp Glv 
145 150 155 160 

Ala Leu Pro Pro Asp He Arg Leu Ser Pro His Leu Tyr Leu Ala Thr 

165 170 175 

Asn Ser Ala Gin Gly Pro Trp Trp He Leu Gly Trp Ser Glu Arg Val 

180 185 190 

Pro Gly Ala Glu Asp Val Leu Pro Ala Pro Leu Pro Pro Tyr Arg Val 

195 200 205 

Leu Thr Gly Met Ala Asp Arg Phe Gly Arg Thr Leu Thr Tyr Arg Arg 

210 215 220 

Glu Ala Ala Gly Asp Leu Ala Gly Glu He Thr Gly Val Thr Asp Gly 
225 230 235 240 

Ala Gly Arg Glu Phe Arg Leu Val Leu Thr Thr Gin Ala Gin Arg Ala 

245 250 255 

Glu Glu Ala Arg Thr Ser Ser Leu Ser Ser Ser Asp Ser Ser Arg Pro 

260 265 270 

Leu Ser Ala Ser Ala Phe Pro Asp Thr Leu Pro Gly Thr Glu Tvr Glv 

275 280 285 

Pro Asp Arg Gly He Arg Leu Ser Ala Val Trp Leu Met His Asp Pro 

290 295 300 

Ala Tyr Pro Glu Ser Leu Pro Ala Ala Pro Leu Val Arg Tyr Thr Tvr 
305 310 315 320 

Thr Glu Ala Gly Glu Leu Leu Ala Val Tyr Asp Arg Ser Asn Thr Gin 

325 330 335 

Val Arg Ala Phe Thr Tyr Asp Ala Gin His Pro Gly Arg Met Val Ala 

340 345 350 

His Arg Tyr Ala Gly Arg Pro Glu Met Arg Tyr Arg Tyr Asp Asp Thr 
355 360 365 
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Gly Arg Val Val Glu Gin Leu Asn Pro Ala Gly Leu Ser Tyr Arg Tyr 

370 375 380 

Leu Tyr Glu Gin Asp Arg lie Thr Val Thr Asp Ser Leu Asn Arg Arg 
385 390 395 400 

Glu Val Leu His Thr Glu Gly Gly Ala Gly Leu Lys Arg Val Val Lys 

405 410 " 415 

Lys Glu Leu Ala Asp Gly Ser Val Thr Arg Ser Gly Tyr Asp Ala Ala 

420 425 430 

Gly Arg Leu Thr Ala Gin Thr Asp Ala Ala Gly Arg Arg Thr Glu Tyr 

435 440 445 

Gly Leu Asn Val Val Ser Gly Asp He Thr Asp He Thr Thr Pro Asp 

450 455 460 

Gly Arg Glu Thr Lys Phe Tyr Tyr Asn Asp Gly Asn Gin Leu Thr Ala 
465 470 475 480 

Val Val Ser Pro Asp Gly Leu Glu Ser Arg Arg Glu Tyr Asp Glu Pro 

485 490 " " 495 

Gly Arg Leu Val Ser Glu Thr Ser Arg Ser Gly Glu Thr Val Arg Tyr 

500 505 510 

Arg Tyr Asp Asp Ala His Ser Glu Leu Pro Ala Thr Thr Thr Asp Ala 

515 520 525 

Thr Gly Ser Thr Arg Gin Met Thr Trp Ser Arg Tyr Gly Gin Leu Leu 

530 535 540 

Ala Phe Thr Asp Cys Ser Gly Tyr Gin Thr Arg Tyr Glu Tyr Asp Arg 
545 550 555 560 

Phe Gly Gin Met Thr Ala Val His Arg Glu Glu Gly He Ser Leu Tyr 

565 570 575 

Arg Arg Tyr Asp Asn Arg Gly Arg Leu Thr Ser Val Lys Asp Ala Gin 

580 585 590 

Gly Arg Glu Thr Arg Tyr Giu Tyr Asn Ala Ala Gly Asp Leu Thr Ala 

595 600 605 

Val He Thr Pro Asp Gly Asn Arg Ser Glu Thr Gin Tyr Asp Ala Trp 

610 615 620 

Gly Lys Ala Val Ser Thr Thr Gin Gly Gly Leu Thr Arg Ser Met Glu 
625 630 635 640 

Tyr Asp Ala Ala Gly Arg Val He Ser Leu Thr Asn Glu Asn Gly Ser 

645 650 655 

His Ser Val Phe Ser Tyr Asp Ala Leu Asp Arg Leu Val Gin Gin Gly 

660 665 670 

Gly Phe Asp Gly Arg Thr Gin Arg Tyr His Tyr Asp Leu Thr Gly Lys 

675 680 685 

Leu Thr Gin Ser Glu Asp Glu Gly Leu Val lie Leu Trp Tyr Tyr Asp 

690 695 700 

Glu Ser Asp Arg He Thr His Arg Thr Val Asn Gly Glu Pro Ala Glu 
705 710 715 720 

Gin Trp Gin Tyr Asp Gly His Gly Trp Leu Thr Asp He Ser His Leu 

725 730 735 

Ser Glu Gly His Arg Val Ala Val His Tyr Gly Tyr Asp Asp Lys Gly 

740 745 750 

Arg Leu Thr Gly Glu Cys Gin Thr Val Glu Asn Pro Glu Thr Gly Glu 

755 760 765 

Leu Leu Trp Gin His Glu Thr Lys His Ala Tyr Asn Glu Gin Gly Leu 

770 775 780 

Ala Asn Arg Val Thr Pro Asp Ser Leu Pro Pro Val Glu Trp Leu Thr 
785 790 795 800 

Tyr Gly Ser Gly Tyr Leu Ala Gly Met Lys Leu Gly Gly Thr Pro Leu 

805 810 815 

Val Glu Tyr Thr Arg Asp Arg Leu His Arg Glu Thr Val Arg Ser Phe 

820 825 830 

Gly Ser Met Ala Gly Ser Asn Ala Ala Tyr Glu Leu Thr Ser Thr Tyr 

835 840 845 

Thr Pro Ala Gly Gin Leu Gin Ser Gin His Leu Asn Ser Leu Val Tyr 
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850 855 860 

Asp Arg Asp Tyr Gly Trp Ser Asp Asn Gly Asp Leu Val Arg lie Ser 
865 870 875 880 

Gly Pro Arg Gin Thr Arg Glu Tyr Gly Tyr Ser Ala Thr Gly Arg Leu 

885 890 895 

Glu Ser Val Arg Thr Leu Ala Pro Asp Leu Asp He Arg He Pro Tyr 

900 905 910 

Ala Thr Asp Pro Ala Gly Asn Arg Leu Pro Asp Pro Glu Leu His Pro 

915 920 " 925 

Asp Ser Thr Leu Thr Val Trp Pro Asp Asn Arg He Ala Glu Asp Ala 

930 935 940 

His Tyr Val Tyr Arg His Asp Glu Tyr Gly Arg Leu Thr Glu Lys Thr 
945 950 955 960 

Asp Arg He Pro Ala Gly Val He Arg Thr Asp Asp Glu Arg Thr His 

965 970 975 

His Tyr His Tyr Asp Ser Gin His Arg Leu Val Phe Tyr Thr Arg He 

980 985 990 

Gin His Gly Glu Pro Leu Val Glu Ser Arg Tyr Leu Tyr Asp Pro Leu 

995 1000 1005 

Gly Arg Arg Met Ala Lys Arg Val Trp Arg Arg Glu Arg Asp Leu Thr 

1010 1015 1020 

Gly Trp Met Ser Leu Ser Arg Lys Pro Glu Val Thr Trp Tyr Gly Trp 
1025 1030 1035 i 0 40 

Asp Gly Asp Arg Leu Thr Thr Val Gin Thr Asp Thr Thr Arg He Gin 

1045 1050 1055 

Thr Val Tyr Glu Pro Gly Ser Phe Thr Pro Leu He Arg Val Glu Thr 

1060 1065 1070 

Glu Asn Gly Glu Arg Glu Lys Ala Gin Arg Arg Ser Leu Ala Glu Thr 

1075 1080 1085 

Leu Gin Gin Glu Gly Ser Glu Asn Gly His Gly Val Val Phe Pro Ala 

1090 1095 HOO 

Glu Leu Val Arg Leu Leu Asp Arg Leu Glu Glu Glu He Arg Ala Asp 
1105 1110 ins " H20 

Arg Val Ser Ser Glu Ser Arg Ala Trp Leu Ala Gin Cys Gly Leu Thr 

1125 1130 " H35 

Val Glu Gin Leu Ala Arg Gin Val Glu Pro Glu Tyr Thr Pro Ala Arg 

1140 H45 ^ H50 

Lys Ala His Leu Tyr His Cys Asp His Arg Gly Leu Pro Leu Ala Leu 

1155 1160 H65 

He Ser Glu Asp Gly Asn Thr Ala Trp Ser Ala Glu Tyr Asp Glu TrD 

1170 1175 H80 

Gly Asn Gin Leu Asn Glu Glu Asn Pro His His Val Tyr Gin Pro Tyr 
1185 1190 H95 ^ 1200 

Arg Leu Pro Gly Gin Gin His Asp Glu Glu Ser Gly Leu Tyr Tyr Asn 

1205 1210 ' 1215 

Arg His Arg Tyr Tyr Asp Pro Leu Gin Gly Arg Tyr He Thr Gin Asp 

1220 1225 1230 

Pro Met Gly Leu Lys Gly Gly Trp Asn Leu Tyr Gin Tyr Pro Leu Asn 

1235 1240 1245 

Pro Leu Gin Gin He Asp Pro Met Gly Leu Leu Gin Thr Trp Asp Asp 

1250 1255 1260 

Ala Arg Ser Gly Ala Cys Thr Gly Gly Val Cys Gly Val Leu Ser Arg 
1265 1270 1275 1280 

He He Gly Pro Ser Lys Phe Asp Ser Thr Ala Asp Ala Ala Leu Asp 

1285 1290 1295 

Ala Leu Lys Glu Thr Gin Asn Arg Ser Leu Cys Asn Asp Met Glu Tyr 

1300 1305 1310 

Ser Gly He Val Cys Lys Asp Thr Asn Gly Lys Tyr Phe Ala Ser Lys 

1315 1320 1325 

Ala Glu Thr Asp Asn Leu Arg Lys Glu Ser Tyr Pro Leu Lys Arg Lys 
1330 1335 1340 
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Cys Pro Thr Gly Thr Asp Arg Val Ala Ala Tyr His Thr His Gly Ala 
1345 1350 1355 1360 

Asp Ser His Gly Asp Tyr Val Asp Glu Phe Phe Ser Ser Ser Asp Lys 

1365 1370 1375 

Asn Leu Val Arg Ser Lys Asp Asn Asn Leu Glu Ala Phe Tyr Leu Ala 

1380 1385 1390 

Thr Pro Asp Gly Arg Phe Glu Ala Leu Asn Asn Lys Gly Glu Tyr lie 

1395 1400 1405 

Phe lie Arg Asn Ser Val Pro Gly Leu Ser Ser Val Cys lie Pro Tyr 

1410 1415 1420 

His Asp 
1425 



<210> 341 
<211> 122 
<212> PRT 
<213> E. Coli 



<400> 


341 




















Met Lys 


Tyr 


Ser 


Ser lie Phe 


Ser Met 


Leu 


Ser 


Phe 


Phe 


He 


Leu 


Phe 


1 






5 




10 










15 




Ala Cys 


Asn 


Glu 


Thr Ala Val 


Tyr Gly 


Ser Asp 


Glu 


Asn 


He 


He 


Phe 






20 




25 










30 






Met Arg 


Tyr 


Val 


Glu Lys Leu 


His Leu 


Asp 


Lys 


Tyr 


Ser 


Val 


Lys 


Asn 




35 






40 








45 








Thr Val 


Lys 


Thr 


Glu Thr Met 


Ala He 


Gin 


Leu 


Ala 


Glu 


He 


Tyr 


Val 


50 






55 








60 








Arg Tyr 


Arg 


Tyr 


Gly Glu Arg 


He Ala 


Glu 


Glu 


Glu 


Lys 


Pro 


Tyr 


Leu 


65 






70 






75 






80 


lie Thr 


Glu 


Leu 


Pro Asp Ser 


Trp Val 


Val 


Glu Gly 


Ala 


Lys 


Leu 


Pro 








85 




90 










95 




Tyr Glu 


Val 


Ala 


Gly Gly Val 


Phe He 


He 


Glu 


He 


Asn 


Lys 


Lys Asn 






100 




105 










110 






Gly Cys 


Val 


Leu 


Asn Phe Leu 


His Ser 


Lys 















115 120 



<210> 342 
<211> 236 
<212> PRT 
<213> E. Coli 



<400> 


342 














Met Leu 


Ala 


Leu 


Met 


Asp Ala 


Asp Gly Asn He Ala 


Trp Ser 


Gly Glu 


1 






5 






10 




15 


Tyr Asp 


Glu 


Trp 


Gly Asn Gin 


Leu 


Asn Glu Glu Asn 


Pro His 


His Leu 






20 








25 


30 




His Gin 


Pro 


Tyr 


Arg 


Leu Pro 


Gly Gin Gin Tyr Asp 


Lys Glu 


Ser Gly 




35 








40 




45 


Leu Tyr 


Tyr Asn 


Arg 


Asn Arg 


Tyr 


Tyr Asp Pro Leu 


Gin Gly Arg Tyr 


50 








55 




60 






He Thr 


Gin 


Asp 


Pro 


He Gly 


Leu 


Glu Gly Gly Trp 


Ser Leu 


Tyr Ala 


65 








70 




75 




80 


Tyr Pro 


Leu 


Asn 


Pro 


Val Asn 


Gly 


lie Asp Pro Leu 


Gly Leu 


Ser Pro 








85 






90 




95 


Ala Asp 


Val 


Ala 


Leu 


He Arg 


Arg 


Lys Asp Gin Leu 


Asn His 


Gin Arg 






100 








105 


110 


Ala Trp 


Asp 


He 


Leu 


Ser Asp 


Thr 


Tyr Glu Asp Met 


Lys Arg 


Leu Asn 




115 








120 




125 




Leu Gly 


Gly Thr 


Asp 


Gin Phe 


Phe 


His Cys Met Ala 


Phe Cys Arg Val 
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130 










135 




Ser Lys 


Leu 


Asn 


Asp Ala 


Gly 


Val 


145 








150 






Tyr Glu 


Lys 


Glu 


He 
165 


Arg 


Asp 


Tyr 


Gly Arg 


Lys 


Val 
180 


Lys 


Leu 


Ser 


His 


Lys Asp 


Leu 
195 


Ala 


Val 


Asn 


Asp 


His 
200 


Asp Cys 


Ser 


Asp 


Arg 


Cys 


Ser 


Asp 


210 










215 




Thr He 


Lys 


Ala 


Leu 


Gin 


Asp 


Ala 


225 








230 







140 

Ser Arg Ser Ala Lys Gly Leu Gly 
155 160 
Gly Leu Asn Leu Phe Gly Met Tyr 

170 175 
Ser Glu Met He Glu Asp Asn Lys 
185 190 
Gly Leu Thr Cys Pro Ser Thr Thr 
205 

Tyr He Asn Pro Glu His Lys Lys 
220 

Gly Tyr Leu Lys 
235 



<210> 343 
<211> 86 
<212> PRT 
<213> E. Coli 

<400> 343 

Met Leu Ala He Ser Ser Asn Leu Ser Lys Met He He Phe He Phe 

1 5 10 15 

Ala He lie lie lie Val Val Leu Cys Val lie Thr Tyr Leu Tvr Leu 

20 25 30 

Tyr Lys Asp Glu Ser Leu Val Ser Lys His Tyr lie Asn Tyr Met Ala 

35 40 45 

lie Pro Glu Asn Asp Gly Val Phe Thr Trp Leu Pro Asp Phe Phe Pro 

50 55 60 

His Val Ala Val Asp lie Ser lie Tyr Thr Asn Val Glu Asp Asp Tyr 
65 70 75 80 

Phe Phe Leu lie Phe Pro 
85 



<210> 344 

<211> 63 

<212> PRT 

<213> E. Coli 

<400> 344 
Met Arg Ala Arg Glu Gin Val Ala 

1 5 
Asp Thr Lys Lys Val Trp Cys Lys 
20 

Asp Gly Val Asn Leu Phe Phe Val 

35 40 
Phe lie Thr Asn lie Gly Ala Gly 
50 55 



Lys 


lie Val 


Ser 


Lys 


Asn Asp Pro 




10 






15 


Tyr 


Gly Lys 


He 


Pro 


Gly Gin Gly 


25 








30 


Gly 


Glu lie 


Asn 


Val 


Thr His Tyr 








45 




Leu 


Pro Asp Ala 


Cys 


Ala Glu 



60 



<210> 345 
<211> 167 
<212> PRT 
<213> E. Coli 

<400> 345 

Met Pro Gly Asn Ser Pro His Tyr Gly Arg Trp Pro Gin His Asp Phe 
15 10 15 
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Thr Ser Leu Lys Lys Leu Arg Pro Gin Ser Val Thr Ser Arg He Gin 

20 25 30 

Pro Gly Ser Asp Val He Val Cys Ala Glu Met Asp Glu Gin Trp Gly 

35 40 45 

Tyr Val Gly Ala Lys Ser Arg Gin Arg Trp Leu Phe Tyr Ala Tyr Asp 

50 55 60 

Ser Leu Arg Lys Thr Val Val Ala His Val Phe Gly Glu Arg Thr Met 
65 70 75 80 

Ala Thr Leu Gly Arg Leu Met Ser Leu Leu Ser Pro Phe Asp Val Val 

85 90 95 

He Trp Met Thr Asp Gly Trp Pro Leu Tyr Glu Ser Arg Leu Lys Gly 

100 105 HO 

Lys Leu His Val He Ser Lys Arg Tyr Thr Gin Arg He Glu Arg His 

115 120 125 

Asn Leu Asn Leu Arg Gin His Leu Ala Arg Leu Gly Arg Lys Ser Leu 

130 135 140 

Ser Phe Ser Lys Ser Val Glu Leu His Asp Lys Val He Gly His Tyr 
145 150 155 ' 160 

Leu Asn He Lys His Tyr Gin 
165 



<210> 346 
<211> 91 
<212> PRT 
<213> E. Coli 



<400> 346 
Met Ala Ser Val Ser lie Ser Cys 

1 5 
Val Val Arg Asn Gly Lys Ser Thr 
20 

Ser His Cys Arg Lys Thr Trp Gin 

35 40 
Gin Pro Gly Thr His Gin Lys lie 

50 55 
Gly Cys Arg Ala Thr Ala Arg lie 
65 70 
Leu Arg His Leu Lys Asn Ser Gly 
85 



Pro Ser Cys Ser Ala Thr Asp Gly 

10 15 
Ala Gly His Gin Arg Tyr Leu Cys 
25 30 
Leu Gin Phe Thr Tyr Thr Ala Ser 
45 

He Asp Met Ala Met Asn Gly Val 
60 

Met Gly Val Gly Leu Asn Thr lie 

75 80 
Arg Ser Arg 
90 



<210> 347 
<211> 138 
<212> PRT 
<213> E . Coli 



<400> 347 
Met Met Thr Lys Thr Gin lie Asn 

1 5 
Leu Asp Tyr Pro Phe Glu Ala Pro 
20 

lie lie Gin lie Glu Phe Asn Ser 

35 40 
Leu Cys Asn Glu Val Ser lie Leu 

50 55 
Thr Phe Leu Arg Ala Met Asp Gly 
65 70 
Phe Ser Leu Ser lie Pro Glu Pro 
85 



Lys Leu He Lys Met Met Asn Asp 

10 15 
Leu Lys Glu Ser Phe lie Glu Ser 
25 30 
Asn Ser Thr Asn Cys Leu Glu Lys 
45 

Phe Lys Asn Gin Pro Asp Tyr Leu 
60 

Phe Glu Val Asn Gly Leu Arg Leu 

7 5 80 
Ser Val Lys Asn Leu Phe Ala Val 
90 95 
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Asn Glu Phe Tyr Arg Asn Asn Asp 
100 

Glu Arg Leu Val He Gly Asp Tyr 
115 120 
lie Lys Gly Asp Ala Ala Asn Leu 
130 135 



Asp Phe He Asn Pro Asp Leu Gin 
105 no 
Ser He Ser He Phe Thr Tyr Asp 
125 

Leu He 



<210> 348 
<211> 392 
<212> PRT 
<213> E. Coli 

<400> 348 

Met Ser Asn He Val Tyr Leu Thr Val Thr Gly Glu Gin Gin Gly Ser 

1 5 10 15 

He Ser Ala Gly Cys Gly Thr Ser Glu Ser Thr Gly Asn Arg Trp Gin 

20 25 30 

Ser Gly His Glu Asp Glu He Phe Thr Phe Ser Leu Leu Asn Asn He 

35 40 45 

Asn Asn Thr Gly Leu Gly Ser Gin Phe His Gly He Thr Phe Cys Lys 

50 55 60 

Leu He Asp Lys Ser Thr Pro Leu Phe He Asn Ser He Asn Asn Asn 
65 70 75 80 

Glu Gin Leu Phe Met Gly Phe Asp Phe Tyr Arg He Asn Arg Phe Gly 

85 90 95 

Arg Leu Glu Lys Tyr Tyr Tyr He Gin Leu Arg Gly Ala Phe Leu Ser 

100 105 HO 

Ala He His His Gin He He Glu Asn Gin Leu Asp Thr Glu Thr He 

115 120 125 

Thr He Ser Tyr Glu Phe He Leu Cys Gin His Leu He Ala Asn Thr 

130 135 140 

Glu Phe Ser Tyr Leu Ala Leu Pro Glu Asn Tyr Asn Arg Leu Phe Leu 
145 150 155 160 

Pro Asn Ser Lys Asn Gin Thr Asn Asn Arg Phe Lys Thr Leu Asn Ser 

165 170 175 

Lys Ala He Gly Arg Leu Leu Ala Ala Gly Gly Val Tyr Asn Gly Asn 

180 185 190 

lie Glu Gly Phe Arg Asp Thr Ala Glu Lys Leu Gly Gly Asp Ala He 

195 200 205 

Lys Gly Tyr Asp Gin He Leu Asn Glu Lys Thr Ala Gly lie Ala lie 

210 215 220 

Ala Thr Ala Ser lie Leu Leu Thr Lys Arg Ser Asn Val Asp Thr Tyr 
225 230 235 " 240 

Thr Glu He Asn Ser Tyr Leu Gly Lys Leu Arg Gly Gin Gin Lys Leu 

245 250 255 

Leu Asp Gly lie Asp lie lie Glu lie lie Tyr lie Lys Arg Pro Ser 

260 265 270 

Lys Asp Leu Ala Asn Leu Arg Lys Glu Phe Asn Lys Thr Val Arg Lys 

275 280 285 

Asn Phe Leu lie Lys Leu Ala Lys Thr Ser Glu Ala Ser Gly Arg Phe 

290 295 300 

Asn Ala Glu Asp Leu Leu Arg Met Arg Lys Gly Asn Val Pro Leu Asn 
305 310 315 320 

Tyr Asn Val His His Lys Leu Ser Leu Asp Asp Gly Gly Thr Asn Asp 

325 330 ' J 335 

Phe Glu Asn Leu Val Leu He Glu Asn Glu Pro Tyr His Lys Val Phe 

340 345 350 

Thr Asn Met Gin Ser Arg He Ala Lys Gly lie Leu Val Gly Glu Ser 

355 360 365 

Lys lie Thr Pro Trp Ala lie Pro Ser Gly Ser lie Tyr Pro Pro Met 



-168- 



WO 00/44906 





370 










375 






Lys 


Asn 


He 


Met 


Asp 


His 


Thr 


Lys 




385 










390 










<210> 


349 














<211> 


221 














<212> 


PRT 














<213> 


E. Coli 












<400> 


349 












Met 


Val 


Leu 


Ala 


Leu 


Asn 


Tyr 


Asn 


Met 


1 








5 










Glu 


Asn 


Ala 


Ala 


Lys 


Pro 


His 


Thr 


Met 








20 










25 


Tyr 


He 


Arg 


Ser 


He 


Glu 


Lys 


Asn 


Gly 






35 










40 




Gly 


Lys 


Leu Arg 


Tyr 


Ser 


Asp 


Glu 


Leu 




50 










55 






Thr 


Phe 


Leu Asp 


Ser Lys 


Arg 


Gin 


Leu 


65 










70 








Gly 


He 


Lys 


Thr 


Lys 


He 


He 


Asp 


Tyr 










85 










Ala 


Asn 


Thr 


Val 


Ala 


Phe 


Glu 


Asp 


Val 








100 










105 


Asp 


Phe 


He 


Leu 


Cys 


Ser 


Asn 


Val 


Leu 






115 










120 




Thr 


He 


Asp Lys 


He 


Val 


Leu 


Ser 


He 




130 










135 






Gly Glu 


Thr 


Leu 


He 


Val 


Asn 


Gin 


Tyr 


145 










150 








Tyr Glu 


Thr Gly 


Arg Lys 


His 


Leu 


Tyr 










165 










Lys 


Ser 


Val 


Ser 


Tyr Tyr 


Gly 


Leu 


Leu 








180 










185 


He 


Cys 


Ser 


Ser 


His 


Gly 


Leu 


Glu 


He 






195 










200 




Gly 


Ser 


Ser 


Tyr 


Val 


Thr 


Val 


Gly 


Ser 




210 










CL D 








<210> 


350 














<211> 


234 














<212> 


PRT 














<213> 


E. Coli 












<400> 


350 












Met 


Asn 


Asn 


Met 


Phe 


Glu 


Pro 


Pro 


Lys 


1 








5 










Lys 


Leu 


His 


Lys 


Ala 


Thr 


Phe 


Leu 


Asn 








20 










25 


Leu 


Val 


He 


Tyr 


Glu 


Tyr 


He 


Pro 


Leu 






35 










40 




Val 


Pro 


Pro 


He 


Lys 


Asp 


His 


Glu 


Ser 




50 










55 






Phe 


Gly 


He 


Leu 


Pro 


Cys 


Ala 


Phe 


Ala 


65 










70 








Gly Ala 


Leu Asp 


Leu 


His 


Asn 


Asn 


Ala 










85 










Tyr 


Leu 


Trp Asp 


Lys 


His 


Leu 


He 


He 
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380 



His 


Gly Val 


Asn 


He 


Arg Ser 


10 










15 


Pro 


Ser 


Arg 


Tyr 


Leu 


Cys Glu 










30 




His 


Ala 


Leu 


Asp 


Phe 


Gly Cys 








45 






He 


Ser 


Lys 


Phe 


Asp 


Glu Val 






60 








Glu 


Arg 


Glu 


Gin 


He 


He Arg 




75 








80 


Val 


Pro 


Arg 


Tyr 


Tyr 


Lys Asn 


90 










95 


Asp 


Lys 


He 


He 


Gly 


Gly Tyr 










110 




Ser 


Ala 


Val 


Pro 


Cys 


Arg Asp 








125 






Lys 


Arg 


Leu 


Leu 


Lys 


Ser Gly 






140 








Lys 


Ser 


Ser 


Tyr 


Phe 


Lys Lys 




155 








160 


Gly 


Tyr 


He 


Tyr 


Lys 


Asn Ser 


170 










175 


Asp 


Glu 


Leu 


Ala 


Val 


Gin Glu 










190 




Leu 


Lys 


Ser 


Trp 


Ser 


Lys Ala 








205 






Cys 


Asn 


Ala 


He 










220 








Asn 


Tyr Asn 


Glu 


Met 


Leu Pro 


10 










15 


Thr 


Leu 


He 


Tyr 


Cys 


He Leu 










30 




He 


Thr 


Leu 


Pro 


Thr 


Lys Tyr 








45 






Phe 


He 


Asn 


Trp 


Ala 


Leu Ser 






60 








He 


Phe 


Ala 


Tyr 


Leu 


He Ser 




75 








80 


Ala 


Lys 


Leu 


Leu 


Arg 


Val Arg 


90 










95 


Lys 


Pro 


Leu 


Ser 


Arg 


Arg Ala 
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100 










105 


Gly Val 


Asn Arg 


Lys 


Leu 


Asn 


Lys 


Asp 




115 










120 




Asn Leu 


Tyr 


Tyr 


Pro 


Glu 


Val 


Arg 


Lys 


130 










135 






Glu Leu 


Phe 


Trp 


Asn 


Lys 


Val 


Tyr 


Tyr 


145 








150 








Ser lie 


lie 


Ala 


Leu 


He 


Ser 


Phe 


Leu 








165 










Met Asp 


lie 


Phe 


His 


Val 


Glu 


Gly 


Ser 






180 










185 


Val lie 


Leu 


Ser 


Phe 


Ser 


Val 


Ser Gly 




195 










200 




Lys Pro 


Arg 


Thr 


Glu 


Ser 


Gin 


Val 


Gly 


210 










215 






Lys Glu 


Phe 


Phe 


Thr 


Lys 


Asn 


Asn 


He 


225 








230 
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110 

Glu Ala His Asn Val Met Ser 
125 

He Glu Asp Lys His Tyr He 
140 

Phe Trp He Phe Phe Glu Phe 
155 160 
He He Phe Phe Cys Lys Gin 
170 175 
Leu Leu Ser Leu Phe Phe Phe 
190 

He He Phe Ala Leu Thr Val 
205 

Lys lie Pro Asp Asp Lys He 
220 

Asn 



<210> 351 
<211> 94 
<212> PRT 
<213> E. Coli 



<400> 


351 








Met Phe 


Thr 


He 


Asn Ala 


Glu 


Val 


1 






5 






Ser Arg 


Arg 


Leu 


Arg Ala Ala 


Asn 






20 








Gly Lys 


Glu 


Ala 


Pro Leu 


Ala 


lie 




35 








40 


Asn Met 


Gin 


Ala 


Lys Ala 


Glu 


Phe 


50 








55 




Val Asp 


Gly 


Lys 


Glu lie 


Lys 


Val 


65 






70 






Pro Tyr 


Lys 


Pro 


Lys Leu 


Gin 


His 



85 



Arg Lys Glu Gin Gly Lys Gly Ala 

10 15 
Lys Phe Pro Ala lie lie Tyr Gly 
25 30 
Glu Leu Asp His Asp Lys Val Met 
45 

Tyr Ser Glu Val Leu Thr lie Val 
60 

Lys Ala Gin Asp Val Gin Arg His 

75 80 
He Asp Phe Val Arg Ala 
90 



<210> 352 
<211> 658 
<212> PRT 
<213> E . Coli 





<400> 


352 












Met 


Val 


Leu 


Phe 


Tyr Arg Ala 


His Trp Arg Asp Tyr 


Lys 


Asn 


Asp Gin 


1 








5 


10 






15 


Val 


Arg 


lie 


Met 


Met Asn Leu 


Thr Thr Leu Thr His 


Arg 


Asp Ala Leu 








20 




25 




30 




Cys 


Leu 


Asn 


Ala 


Arg Phe Thr 


Ser Arg Glu Glu Ala 


lie 


His 


Ala Leu 






35 






40 


45 






Thr 


Gin 


Arg 


Leu 


Ala Ala Leu 


Gly Lys lie Ser Ser 


Thr 


Glu 


Gin Phe 




50 






55 


60 








Leu 


Glu 


Glu 


Val 


Tyr Arg Arg 


Glu Ser Leu Gly Pro 


Thr 


Ala 


Leu Gly 


65 








70 


75 






80 


Glu 


Gly 


Leu 


Ala 


Val Pro His 


Gly Lys Thr Ala Ala 


Val 


Lys 


Glu Ala 










85 


90 






95 


Ala 


Phe 


Ala 


Val 


Ala Thr Leu 


Ser Glu Pro Leu Gin 


Trp 


Glu 


Gly Val 








100 




. 105 




110 


Asp 


Gly 


Pro 


Glu 


Ala Val Asp 


Leu Val Val Leu Leu 


Ala 


lie 


Pro Pro 
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115 120 125 

Asn Glu Ala Gly Thr Thr His Met Gin Leu Leu Thr Ala Leu Thr Thr 

130 135 i 4 o 

Arg Leu Ala Asp Asp Glu lie Arg Ala Arg He Gin Ser Ala Thr Thr 
145 150 155 160 

Pro Asp Glu Leu Leu Ser Ala Leu Asp Asp Lys Gly Gly Thr Gin Pro 

165 170 175 

Ser Ala Ser Phe Ser Asn Ala Pro Thr He Val Cys Val Thr Ala Cys 

180 185 190 

Pro Ala Gly He Ala His Thr Tyr Met Ala Ala Glu Tyr Leu Glu Lys 

195 200 205 

Ala Gly Arg Lys Leu Gly Val Asn Val Tyr Val Glu Lys Gin Gly Ala 

210 215 220 

Asn Gly He Glu Gly Arg Leu Thr Ala Asp Gin Leu Asn Ser Ala Thr 
22 5 230 235 2 40 

Ala Cys He Phe Ala Ala Glu Val Ala He Lys Glu Ser Glu Arg Phe 

245 250 255 

Asn Gly He Pro Ala Leu Ser Val Pro Val Ala Glu Pro He Arg His 

260 265 270 

Ala Glu Ala Leu He Gin Gin Ala Leu Thr Leu Lys Arg Ser Asp Glu 

275 280 285 

Thr Arg Thr Val Gin Gin Asp Thr Gin Pro Val Lys Ser Val Lys Thr 

290 295 300 

Glu Leu Lys Gin Ala Leu Leu Ser Gly He Ser Phe Ala Val Pro Leu 
305 310 315 320 

He Val Ala Gly Gly Thr Val Leu Ala Val Ala Val Leu Leu Ser Gin 

325 330 335 

He Phe Gly Leu Gin Asp Leu Phe Asn Glu Glu Asn Ser Trp Leu Trp 

340 345 350 

Met Tyr Arg Lys Leu Gly Gly Gly Leu Leu Gly He Leu Met Val Pro 

355 360 3 6 5 

Val Leu Ala Ala Tyr Thr Ala Tyr Ser Leu Ala Asp Lys Pro Ala Leu 

370 375 380 

Ala Pro Gly Phe Ala Ala Gly Leu Ala Ala Asn Met He Gly Ser Gly 
385 390 395 4 00 

Phe Leu Gly Ala Val Val Gly Gly Leu He Ala Gly Tyr Leu Met Arg 

405 410 415 

Trp Val Lys Asn His Leu Arg Leu Ser Ser Lys Phe Asn Gly Phe Leu 

420 425 430 

Thr Phe Tyr Leu Tyr Pro Val Leu Gly Thr Leu Gly Ala Gly Ser Leu 

435 440 445 

Met Leu Phe Val Val Gly Glu Pro Val Ala Trp He Asn Asn Ser Leu 

450 455 460 

Thr Ala Trp Leu Asn Gly Leu Ser Gly Ser Asn Ala Leu Leu Leu Gly 
4^5 470 475 480 

Ala He Leu Gly Phe Met Cys Ser Phe Asp Leu Gly Gly Pro Val Asn 

485 490 ' 495 

Lys Ala Ala Tyr Ala Phe Cys Leu Gly Ala Met Ala Asn Gly Val Tyr 

500 505 510 

Gly Pro Tyr Ala He Phe Ala Ser Val Lys Met Val Ser Ala Phe Thr 

515 520 525 

Val Thr Ala Ser Thr Met Leu Ala Pro Arg Leu Phe Lys Glu Phe Glu 

530 535 540 

He Glu Thr Gly Lys Ser Thr Trp Leu Leu Gly Leu Ala Gly He Thr 
545 550 555 560 

Glu Gly Ala He Pro Met Ala He Glu Asp Pro Leu Arg Val He Gly 

565 570 575 

Ser Phe Val Leu Gly Ser Met Val Thr Gly Ala He Val Gly Ala Met 

580 585 590 

Asn He Gly Leu Ser Thr Pro Gly Ala Gly He Phe Ser Leu Phe Leu 
595 600 605 
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Leu His Asp Asn Gly Ala Gly Gly Val Met Ala Ala He Gly Trp Phe 

610 615 620 

Gly Ala Ala Leu Val Gly Ala Ala He Ser Thr Ala He Leu Leu Met 
625 630 635 640 

Trp Arg Arg His Ala Val Lys His Gly Asn Tyr Leu Thr Asp Gly Val 
645 650 655 

Met Pro 



<210> 353 
<211> 877 
<212> PRT 
<213> E. Coli 

<400> 353 

Met Lys Ala Val Ser Arg Val His He Thr Pro His Met His Trp Asp 

1 5 10 15 

Arg Glu Trp Tyr Phe Thr Thr Glu Glu Ser Arg He Leu Leu Val Asn 

20 25 30 

Asn Met Glu Glu He Leu Cys Arg Leu Glu Gin Asp Asn Glu Tyr Lys 

35 40 45 

Tyr Tyr Val Leu Asp Gly Gin Thr Ala He Leu Glu Asp Tyr Phe Ala 

50 55 60 

Val Lys Pro Glu Asn Lys Asp Arg Val Lys Lys Gin Val Glu Ala Gly 
65 70 75 80 

Lys Leu He He Gly Pro Trp Tyr Thr Gin Thr Asp Thr Thr He Val 

85 90 ^ 95 

Ser Ala Glu Ser lie Val Arg Asn Leu Met Tyr Gly Met Arg Asp Cys 

100 105 no 

Leu Ala Phe Gly Glu Pro Met Lys He Gly Tyr Leu Pro Asp Ser Phe 

115 120 125 

Gly Met Ser Gly Gin Leu Pro His He Tyr Asn Gly Phe Gly He Thr 

130 135 140 

Arg Thr Met Phe Trp Arg Gly Cys Ser Glu Arg His Gly Thr Asp Lys 
145 150 155 160 

Thr Glu Phe Leu Trp Gin Ser Ser Asp Gly Ser Glu Val Thr Ala Gin 

165 170 175 

Val Leu Pro Leu Gly Tyr Ala He Gly Lys Tyr Leu Pro Ala Asp Glu 

180 185 190 

Asn Gly Leu Arg Lys Arg Leu Asp Ser Tyr Phe Asp Val Leu Glu Lys 

195 200 205 

Ala Ser Val Thr Lys Glu He Leu Leu Pro Asn Gly His Asp Gin Met 

210 215 220 

Pro Leu Gin Gin Asn lie Phe Glu Val Met Asp Lys Leu Arg Glu lie 
225 230 235 240 

Tyr Pro Gin Arg Lys Phe Val Met Ser Arg Phe Glu Glu Val Phe Glu 

245 250 255 

Lys He Glu Ala Gin Arg Asp Asn Leu Ala Thr Leu Lys Gly Glu Phe 

260 265 270 

He Asp Gly Lys Tyr Met Arg Val His Arg Thr lie Gly Ser Thr Arg 

275 280 285 

Met Asp lie Lys lie Ala His Ala Arg lie Glu Asn Lys lie Val Asn 

290 295 300 

Leu Leu Glu Pro Leu Ala Thr Leu Ala Trp Thr Leu Gly Phe Glu Tyr 
305 310 315 " 320 

His His Gly Leu Leu Glu Lys Met Trp Lys Glu lie Leu Lys Asn His 

325 330 335 

Ala His Asp Ser lie Gly Cys Cys Cys Ser Asp Lys Val His Arg Glu 

340 345 350 

lie Val Ala Arg Phe Glu Leu Ala Glu Asp Met Ala Asp Asn Leu lie 
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355 360 365 

Arg Phe Tyr Met Arg Lys He Ala Asp Asn Met Pro Gin Ser Asp Ala 

370 375 380 

Asp Lys Leu Val Leu Phe Asn Leu Met Pro Trp Pro Arg Glu Glu Val 
385 390 395 400 

He Asn Thr Thr Val Arg Leu Arg Ala Ser Gin Phe Asn Leu Arg Asp 

405 410 415 

Asp Arg Gly Gin Pro Val Pro Tyr Phe He Arg His Ala Arg Glu He 

420 425 430 

Asp Pro Gly Leu He Asp Arg Gin He Val His Tyr Gly Asn Tyr Asp 

435 440 445 

Pro Phe Met Glu Phe Asp He Gin He Asn Gin He Val Pro Ser Met 

450 455 460 

Gly Tyr Arg Thr Leu Tyr He Glu Ala Asn Gin Pro Gly Asn Val He 
4 ^5 470 475 ' 480 

Ala Ala Lys Ser Asp Ala Glu Gly He Leu Glu Asn Ala Phe Trp Gin 

485 490 495 

He Ala Leu Asn Glu Asp Gly Ser Leu Gin Leu Val Asp Lys Asp Ser 

500 505 510 

Gly Val Arg Tyr Asp Arg Val Leu Gin He Glu Glu Ser Ser Asp Asp 

515 520 525 

Gly Asp Glu Tyr Asp Tyr Ser Pro Ala Lys Glu Glu Trp Val He Thr 

530 535 540 

Ala Ala Asn Ala Lys Pro Gin Cys Asp He He His Glu Ala Trp Gin 
545 550 555 560 

Ser Arg Ala Val He Arg Tyr Asp Met Ala Val Pro Leu Asn Leu Ser 

565 570 575 

Glu Arg Ser Ala Arg Gin Ser Thr Gly Arg Val Gly Val Val Leu Val 

580 585 590 

Val Thr Leu Ser His Asn Ser Arg Arg He Asp Val Asp He Asn Leu 

595 600 605 

Asp Asn Gin Ala Asp Asp His Arg Leu Arg Val Leu Val Pro Thr Pro 

610 615 620 

Phe Asn Thr Asp Ser Val Leu Ala Asp Thr Gin Phe Gly Ser Leu Thr 
"5 630 635 640 

Arg Pro Val Asn Asp Ser Ala Met Asn Asn Trp Gin Gin Glu Gly Trp 

645 650 655 

Lys Glu Ala Pro Val Pro Val Trp Asn Met Leu Asn Tyr Val Ala Leu 

660 665 670 

Gin Glu Gly Arg Asn Gly Met Ala Val Phe Ser Glu Gly Leu Arg Glu 

675 680 685 

Phe Glu Val He Gly Glu Glu Lys Lys Thr Phe Ala He Thr Leu Leu 

690 695 700 

Arg Gly Val Gly Leu Leu Gly Lys Glu Asp Leu Leu Leu Arg Pro Gly 
705 710 715 720 

Arg Pro Ser Gly He Lys Met Pro Val Pro Asp Ser Gin Leu Arg Gly 

725 730 735 

Leu Leu Ser Cys Arg Leu Ser Leu Leu Ser Tyr Thr Gly Thr Pro Thr 

740 745 ' 750 

Ala Ala Gly Val Ala Gin Gin Ala Arg Ala Trp Leu Thr Pro Val Gin 

755 760 765 

Cys Tyr Asn Lys He Pro Trp Asp Val Met Lys Leu Asn Lys Ala Gly 

770 775 780 

Phe Asn Val Pro Glu Ser Tyr Ser Leu Leu Lys Met Pro Pro Val Gly 
785 790 795 800 

Cys Leu He Ser Ala Leu Lys Lys Ala Glu Asp Arg Gin Glu Val He 

805 810 815 

Leu Arg Leu Phe Asn Pro Ala Glu Ser Ala Thr Cys Asp Ala Thr Val 

820 825 830 

Ala Phe Ser Arg Glu Val He Ser Cys Ser Glu Thr Met Met Asp Glu 
835 840 845 
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His lie Thr Thr Glu Glu Asn Gin Gly Ser Asn Leu Ser Gly Pro Phe 

850 855 860 

Leu Pro Gly Gin Ser Arg Thr Phe Ser Tyr Arg Leu Ala 

865 870 875 



<210> 354 
<211> 523 
<212> PRT 
<213> E. Coli 



<400> 354 

Met Met Leu Asp He Val Glu Leu Ser Arg Leu Gin Phe Ala Leu Thr 

15 10 15 

Ala Met Tyr His Phe Leu Phe Val Pro Leu Thr Leu Gly Met Ala Phe 

20 25 30 

Leu Leu Ala He Met Glu Thr Val Tyr Val Leu Ser Gly Lys Gin He 

35 40 45 

Tyr Lys Asp Met Thr Lys Phe Trp Gly Lys Leu Phe Gly He Asn Phe 

50 55 60 

Ala Leu Gly Val Ala Thr Gly Leu Thr Met Glu Phe Gin Phe Gly Thr 
65 70 75 80 

Asn Trp Ser Tyr Tyr Ser His Tyr Val Gly Asp He Phe Gly Ala Pro 

85 90 95 

Leu Ala He Glu Gly Leu Met Ala Phe Phe Leu Glu Ser Thr Phe Val 

100 105 no 

Gly Leu Phe Phe Phe Gly Trp Asp Arg Leu Gly Lys Val Gin His Met 

115 120 125 

Cys Val Thr Trp Leu Val Ala Leu Gly Ser Asn Leu Ser Ala Leu Trp 

130 135 140 

He Leu Val Ala Asn Gly Trp Met Gin Asn Pro He Ala Ser Asp Phe 
145 150 155 160 

Asn Phe Glu Thr Met Arg Met Glu Met Val Ser Phe Ser Glu Leu Val 

165 170 175 

Leu Asn Pro Val Ala Gin Val Lys Phe Val His Thr Val Ala Ser Gly 

180 185 190 

Tyr Val Thr Gly Ala Met Phe lie Leu Gly He Ser Ala Trp Tyr Met 

195 200 205 

Leu Lys Gly Arg Asp Phe Ala Phe Ala Lys Arg Ser Phe Ala He Ala 

210 215 220 

Ala Ser Phe Gly Met Ala Ala Val Leu Ser Val He Val Leu Gly Asp 
225 230 235 240 

Glu Ser Gly Tyr Glu Met Gly Asp Val Gin Lys Thr Lys Leu Ala Ala 

245 250 255 

He Glu Ala Glu Trp Glu Thr Gin Pro Ala Pro Ala Ala Phe Thr Leu 

260 265 270 

Phe Gly He Pro Asp Gin Glu Glu Glu Thr Asn Lys Phe Ala He Gin 

275 280 285 

He Pro Tyr Ala Leu Gly He He Ala Thr Arg Ser Val Asp Thr Pro 

290 295 300 

Val He Gly Leu Lys Glu Leu Met Val Gin His Glu Glu Arg He Arg 
305 310 315 320 

Asn Gly Met Lys Ala Tyr Ser Leu Leu Glu Gin Leu Arg Ser Gly Ser 

325 330 335 

Thr Asp Gin Ala Val Arg Asp Gin Phe Asn Ser Met Lys Lys Asp Leu 

340 345 350 

Gly Tyr Gly Leu Leu Leu Lys Arg Tyr Thr Pro Asn Val Ala Asp Ala 

355 360 365 

Thr Glu Ala Gin He Gin Gin Ala Thr Lys Asp Ser He Pro Arg Val 

370 375 380 

Ala Pro Leu Tyr Phe Ala Phe Arg He Met Val Ala Cys Gly Phe Leu 
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385 








Leu Leu 


Ala 


He 


He 








405 


He Gly 


Glu 


Lys 


Lys 






420 




Leu Pro 


Trp 


He 


Ala 




435 






Arg Gin 


Pro Trp 


Ala 


450 








Ser Ser 


Leu 


Thr 


Ala 


4 65 








Gly Leu 


Tyr 


Thr 


Leu 








485 


Phe Ala 


Arg 


Leu 


Gly 






500 




Glu Gin 


Ser 


Ser 


Thr 




515 







390 395 
Ala Leu Ser Phe Trp Ser 
410 

Trp Leu Leu Arg Ala Ala 
425 

Val Glu Ala Gly Trp Phe 
440 

He Gly Glu Val Leu Pro 
455 

Gly Asp Leu He Phe Ser 
470 475 
Phe Leu Val Ala Glu Leu 
4 90 

Pro Ser Ser Leu Lys Thr 
505 

Thr Thr Gin Pro Ala Arg 
520 



400 

Val He Arg Asn Arg 
415 

Leu Tyr Gly He Pro 
430 

Val Ala Glu Tyr Gly 
445 

Thr Ala Val Ala Asn 
4 60 

Met Val Leu He Cys 
480 

Phe Leu Met Phe Lys 
4 95 

Gly Arg Tyr His Phe 
510 



<210> 355 
<211> 379 
<212> PRT 
<213> E. Coli 

<400> 355 

Met He Asp Tyr Glu Val Leu Arg Phe He Trp Trp Leu Leu Val Gly 

1 5 10 15 

Val Leu Leu He Gly Phe Ala Val Thr Asp Gly Phe Asp Met Gly Val 

20 25 30 

Gly Met Leu Thr Arg Phe Leu Gly Arg Asn Asp Thr Glu Arg Arg He 

35 40 45 

Met He Asn Ser He Ala Pro His Trp Asp Gly Asn Gin Val Trp Leu 

50 55 60 

He Thr Ala Gly Gly Ala Leu Phe Ala Ala Trp Pro Met Val Tyr Ala 
65 70 75 80 

Ala Ala Phe Ser Gly Phe Tyr Val Ala Met He Leu Val Leu Ala Ser 

85 90 95 

Leu Phe Phe Arg Pro Val Gly Phe Asp Tyr Arg Ser Lys He Glu Glu 

100 105 HO 

Thr Arg Trp Arg Asn Met Trp Asp Trp Gly He Phe He Gly Ser Phe 

115 120 125 

Val Pro Pro Leu Val He Gly Val Ala Phe Gly Asn Leu Leu Gin Gly 

130 135 140 

Val Pro Phe Asn Val Asp Glu Tyr Leu Arg Leu Tyr Tyr Thr Gly Asn 
145 150 155 ' 160 

Phe Phe Gin Leu Leu Asn Pro Phe Gly Leu Leu Ala Gly Val Val Ser 

165 170 " 175 

Val Gly Met He He Thr Gin Gly Ala Thr Tyr Leu Gin Met Arg Thr 

180 185 190 

Val Gly Glu Leu His Leu Arg Thr Arg Ala Thr Ala Gin Val Ala Ala 

195 200 205 

Leu Val Thr Leu Val Cys Phe Ala Leu Ala Gly Val Trp Val Met Tyr 

210 215 220 

Gly He Asp Gly Tyr Val Val Lys Ser Thr Met Asp His Tyr Ala Ala 
225 230 235 240 

Ser Asn Pro Leu Asn Lys Glu Val Val Arg Glu Ala Gly Ala Trp Leu 

245 250 ' 255 

Val Asn Phe Asn Asn Thr Pro He Leu Trp Ala He Pro Ala Leu Gly 

260 265 270 

Val Val Leu Pro Leu Leu Thr He Leu Thr Ala Arg Met Asp Lys Ala 
275 280 285 
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Ala 


Trp 


Ala 


Phe 


Val 


Phe 


Ser 


Ser 




290 










295 




Thr 


Ala 


Gly 


He 


Ala 


Met 


Phe 


Pro 


305 










310 






Met 


Asn 


Ala 


Ser 


Leu 


Thr 


Met 


Trp 










325 






Leu 


Asn 


Val 


Met 


Thr 


Trp 


Val 


Ala 








340 










Leu 


Tyr 


Thr 


Ala 


Trp 


Cys 


Tyr 


Trp 






355 










360 


Glu Asp 


He 


Glu 


Arg Asn 


Thr 


His 




370 










375 





Leu Thr Leu Ala Cys He He Leu 
300 

Phe Val Met Pro Ser Ser Thr Met 
315 320 
Asp Ala Thr Ser Ser Gin Leu Thr 

330 335 
Val Val Leu Val Pro He He Leu 
345 350 
Lys Met Phe Gly Arg He Thr Lys 
365 

Ser Leu Tyr 



<210> 356 
<211> 456 
<212> PRT 
<213> E. Coli 

<400> 356 

Met Glu Leu Ser Ser Leu Thr Ala Val Ser Pro Val Asp Gly Arg Tyr 

15 10 15 

Gly Asp Lys Val Ser Ala Leu Arg Gly He Phe Ser Glu Tyr Gly Leu 

20 25 30 

Leu Lys Phe Arg Val Gin Val Glu Val Arg Trp Leu Gin Lys Leu Ala 

35 40 45 

Ala His Ala Ala He Lys Glu Val Pro Ala Phe Ala Ala Asp Ala He 

50 55 60 

Gly Tyr Leu Asp Ala He Val Ala Ser Phe Ser Glu Glu Asp Ala Ala 
65 70 75 * 80 

Arg lie Lys Thr He Glu Arg Thr Thr Asn His Asp Val Lys Ala Val 

85 90 95 

Glu Tyr Phe Leu Lys Glu Lys Val Ala Glu He Pro Glu Leu His Ala 

100 105 HO 

Val Ser Glu Phe He His Phe Ala Cys Thr Ser Glu Asp He Asn Asn 

115 120 125 

Leu Ser His Ala Leu Met Leu Lys Thr Ala Arg Asp Glu Val He Leu 

130 135 140 

Pro Tyr Trp Arg Gin Leu He Asp Gly He Lys Asp Leu Ala Val Gin 
145 150 155 160 

Tyr Arg Asp He Pro Leu Leu Ser Arg Thr His Gly Gin Pro Ala Thr 

165 170 175 

Pro Ser Thr lie Gly Lys Glu Met Ala Asn Val Ala Tyr Arg Met Glu 

180 185 190 

Arg Gin Tyr Arg Gin Leu Asn Gin Val Glu He Leu Gly Lys He Asn 

195 200 205 

Gly Ala Val Gly Asn Tyr Asn Ala His He Ala Ala Tyr Pro Glu Val 

210 215 220 

Asp Trp His Gin Phe Ser Glu Glu Phe Val Thr Ser Leu Gly He Gin 
225 230 235 ' 240 

Trp Asn Pro Tyr Thr Thr Gin He Glu Pro His Asp Tyr lie Ala Glu 

245 250 - 255 

Leu Phe Asp Cys Val Ala Arg Phe Asn Thr He Leu lie Asp Phe Asp 

260 265 270 

Arg Asp Val Trp Gly Tyr lie Ala Leu Asn His Phe Lys Gin Lys Thr 

275 280 285 

He Ala Gly Glu lie Gly Ser Ser Thr Met Pro His Lys Val Asn Pro 

290 295 300 

lie Asp Phe Glu Asn Ser Glu Gly Asn Leu Gly Leu Ser Asn Ala Val 
305 310 315 320 

Leu Gin His Leu Ala Ser Lys Leu Pro Val Ser Arg Trp Gin Arg Asp 
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325 



Leu 


Thr 


Asp Ser 


Thr 


Val 


Leu 


Arg 






340 










Ala 


Leu 


He Ala 


Tyr 


Gin 


Ser 


Thr 






355 








360 


Val 


Asn 


Arg Asp 


His 


Leu 


Leu 


Asp 




370 








375 




Leu 


Ala 


Glu Pro 


He 


Gin 


Thr 


Val 


385 








390 






Pro 


Tyr 


Glu Lys 


Leu 


Lys 


Glu 


Leu 








405 








Glu Gly 


Met Lys 


Gin 


Phe 


He 


Asp 






420 










Lys 


Ala 


Arg Leu 


Lys 


Ala 


Met 


Thr 






435 








440 


lie 


Thr 


Met Val 


Asp 


Glu 


Leu 


Lys 




450 








455 



330 335 
Asn Leu Gly Val Gly He Gly Tyr 
345 350 
Leu Lys Gly Val Ser Lys Leu Glu 
365 

Glu Leu Asp His Asn Trp Glu Val 
380 

Met Arg Arg Tyr Gly He Glu Lys 
395 400 
Thr Arg Gly Lys Arg Val Asp Ala 

410 415 
Gly Leu Ala Leu Pro Glu Glu Glu 
425 430 
Pro Ala Asn Tyr He Gly Arg Ala 
445 



<210> 357 

<211> 61 

<212> PRT 

<213> E . Coli 

<400> 357 

Met Leu He Leu Thr Arg Arg Val Gly Glu Thr Leu Met lie Gly Asd 

1 5 10 15 

Glu Val Thr Val Thr Val Leu Gly Val Lys Gly Asn Gin Val Arg He 

20 25 30 

Gly Val Asn Ala Pro Lys Glu Val Ser Val His Arg Glu Glu lie Tyr 

35 40 45 

Gin Arg lie Gin Ala Glu Lys Ser Gin Gin Ser Ser Tyr 
50 55 60 



<210> 358 
<211> 83 
<212> RNA 
<213> E . Coli 

<400> 358 

ggugaggugg ccgagaggcu gaaggcgcuc cccugcuaag ggaguaugcg gucaaaagcu 60 
gcauccgggg uucgaauccc cgccucaccg cca 93 



<210> 359 
<211> 200 
<212> PRT 
<213> E. Coli 



<400> 359 



Meu 


Lys 


Asn 


Lys 


Ala 


Asp Asn 


Lys 


Lys 


Arg Asn 


Phe Leu 


Thr His Ser 


1 








5 








10 






15 


Glu 


lie 


Glu 


Ser 


Leu 


Leu Lys 


Ala 


Ala 


Asn 


Thr Gly Pro His Ala Ala 








20 








25 








30 


Arg 


Asn 


Tyr 


Cys 


Leu 


Thr Leu 


Leu 


Cys 


Phe 


lie 


His Gly 


Phe Arg Ala 






35 








40 








45 


Ser 


Glu 


lie 


Cys 


Arg 


Leu Arg 


lie 


Ser Asp 


lie 


Asp Leu 


Lys Ala Lys 




50 








55 










60 


Cys 


lie 


Tyr 


He 


His 


Arg Leu 


Lys 


Lys 


Gly 


Phe 


Ser Thr 


Thr His Pro 


65 










70 








75 




80 
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Leu Leu 


Asn 


Lys 


Glu 


Val Gin 


Ala 








85 






Thr Ser 


Tyr 


Pro 


His 


Ala Glu 


Ser 






100 








.Gly Asn 


Pro 


Leu 


Ser Arg Gin Gin 




115 








120 


L>xy biy 


Asn 


Ala 


Gly Leu Ser Leu 


130 








135 




his ber 


Cys Gly 


Phe 


Ala Leu 


Ala 


1 vl C 








150 




lie Gin 


Asp 


Tyr 


Leu 


Gly His 


Arg 








165 






Thr Ala 


Ser 


Asn 


Ala 


Gly Arg 


Phe 






180 








Gly Arg 


Gin 


Arg 


His 


Ala Val 


Leu 




195 








200 



Leu Lys Asn Trp Leu Ser He Arg 

90 95 
Glu Trp Val Phe Leu Ser Arg Lys 
105 no 
Phe Tyr His He He Ser Thr Ser 
125 

Glu He His Pro His Met Leu Arg 
140 

Asn Met Gly He Asp Thr Arg Leu 
155 i 6 o 
Asn He Arg His Thr Val Trp Tyr 

170 175 
Tyr Gly He Trp Asp Arg Ala Arg 
185 190 



<210> 360 
<211> 198 
<212> PRT 
<213> E. Coli 



<400> 360 

Met Ser Lys Arg Arg Tyr Leu Thr Gly Lys 

1 5 io 

Gin Ala Val Cys Tyr Gly Ala Thr Gly Ala 

20 25 
Leu Leu Ala Tyr Arg His Gly Met Arg He 

35 40 
His Tyr Gin Asp Leu Asp Leu Asn Glu Gly 

50 55 
Leu Lys Asn Gly Phe Ser Thr Val His Pro 
65 70 

Glu Ala Val Glu Arg Trp Thr Gin Glu Arg 

85 90 
Asp Arg Thr Asp Ala He Phe He Ser Arg 

100 105 
Arg Gin Gin Ala Tyr Arg He He Arg Asp 

115 120 
Thr Val Thr Gin Thr His Pro His Met Leu 

130 135 
Glu Leu Ala Glu Arg Gly Ala Asp Thr Arg 
145 150 

Gly His Arg Asn He Arg His Thr Val Arg 
165 170 
Ala Arg Phe Ala Gly Leu Trp Glu Arg Asn 

180 185 
Leu Lys Arg Glu Glu Val 
195 



Glu Val Gin Ala 
Arg Asp 
Ser Glu 



Arg He 
60 

Leu Arg 
75 

Ala Asn 

Arg Gly 

Ala Gly 

Arg His 
140 
Leu He 
155 

Tyr Thr 
Asn Leu 



Tyr Cys 

30 
Leu Leu 
45 

Asn He 



Phe Asp 

Trp Lys 

Ser Arg 
110 
He Glu 
125 

Ala Cys 

Gin Asp 

Ala Ser 

He Asn 
190 



Met Met 
15 

Leu He 

Asp Leu 

Arg Arg 

Glu Arg 

80 
Gly Ala 
95 

Leu Ser 

Ala Gly 

Gly Tyr 

Tyr Leu 
160 
Asn Ala 
175 

Glu Lys 



<210> 361 
<211> 182 
<212> PRT 
<213> E. Coli 



<400> 361 

Met Lys He Lys Thr Leu Ala lie Val Val Leu Ser Ala Leu Ser Leu 
15 io is 
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Ser 


Ser 


Thr 


Ala 
20 


Ala 


Val 


His 


Phe 
35 


Lys 


Gly 


Gly Ser 


Val 


Asp 


Gin 




50 








Leu 


Ala 


Gin 


Glu 


Gly 


65 










Leu 


Asn 


Asp 


Cys 


Asp 
85 


Leu 


Gly 


Thr 


Ala 
100 


He 


Ser 


Ser 


Ala 
115 


Ala 


Gly 


Arg Thr 


Gly Ala 


Ala 




130 








Thr 


Thr 


Leu 


Asn 


Asn 


145 










Phe 


Ala 


Thr Gly 


Ala 










165 


Phe 


Lys 


Val 


Gin 
180 


Tyr 



Leu Ala Ala Ala Thr Thr 
25 

Glu Val Val Asn Ala Ala 
40 

Thr Val Gin Leu Gly Gin 
55 

Ala Thr Ser Ser Ala Val 
70 75 
Thr Asn Val Ala Ser Lys 
90 

Asp Ala Gly His Thr Asn 
105 

Ser Ala Thr Asn Val Gly 
120 

Leu Thr Leu Asp Gly Ala 
135 

Gly Thr Asn Thr He Pro 
150 155 
Ala Thr Pro Gly Ala Ala 
170 

Gin 



Val Asn Gly Gly Thr 
30 

Cys Ala Val Asp Ala 
45 

Val Arg Thr Ala Ser 
60 

Gly Phe Asn He Gin 
80 

Ala Ala Val Ala Phe 
95 

Val Leu Ala Leu Gin 
110 

Val Gin He Leu Asp 
125 

Thr Phe Ser Ser Glu 
140 

Phe Gin Ala Arg Tyr 
160 

Asn Ala Asp Ala Thr 
175 



<210> 362 
<211> 215 
<212> PRT 
<213> E . Coli 

<400> 362 

Met Leu Leu Met Arg Met Arg Pro Ser Arg Phe Ser He Asn Asn Leu 

15 10 15 

Pro Arg Phe Arg Asp Val He Thr Gly Arg Asp Ala His Pro Cys Ala 

20 25 30 

He Lys He Thr Met Lys Arg Lys Arg Leu Phe Leu Leu Ala Ser Leu 

35 40 45 

Leu Pro Met Phe Ala Leu Ala Gly Asn Lys Trp Asn Thr Thr Leu Pro 

50 55 60 

Gly Gly Asn Met Gin Phe Gin Gly Val He He Ala Glu Thr Cys Arg 
65 70 75 80 

He Glu Ala Gly Asp Lys Gin Met Thr Val Asn Met Gly Gin He Ser 

85 90 95 

Ser Asn Arg Phe His Ala Val Gly Glu Asp Ser Ala Pro Val Pro Phe 

100 105 no 

Val He His Leu Arg Glu Cys Ser Thr Val Val Ser Glu Arg Val Gly 

115 120 125 

Val Ala Phe His Gly Val Ala Asp Gly Lys Asn Pro Asp Val Leu Ser 

130 135 140 

Val Gly Glu Gly Pro Gly He Ala Thr Asn He Gly Val Ala Leu Phe 
145 150 155 160 

Asp Asp Glu Gly Asn Leu Val Pro He Asn Arg Pro Pro Ala Asn Trp 

165 170 " 175 

Lys Arg Leu Tyr Ser Gly Ser Thr Ser Leu His Phe He Ala Lys Tyr 

180 185 190 

Arg Ala Thr Gly Arg Arg Val Thr Gly Gly He Ala Asn Ala Gin Ala 

195 200 205 

Trp Phe Ser Leu Thr Tyr Gin 
210 215 
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<210> 363 
<211> 241 
<212> PRT 
<213> E. Coli 

<400> 363 

Met Ser Asn Lys Asn Val Asn Val Arg Lys Ser Gin Glu He Thr Phe 

1 5 io 15 

Cys Leu Leu Ala Gly lie Leu Met Phe Met Ala Met Met Val Ala Glv 

20 25 30 

Arg Ala Glu Ala Gly Val Ala Leu Gly Ala Thr Arg Val He Tyr Pro 

35 40 45 

Ala Gly Gin Lys Gin Glu Gin Leu Ala Val Thr Asn Asn Asp Glu Asn 

50 55 60 

Ser Thr Tyr Leu He Gin Ser Trp Val Glu Asn Ala Asp Gly Val Lys 
65 ™ 75 1 80 

Asp Gly Arg Phe He Val Thr Pro Pro Leu Phe Ala Met Lys Glv Lvs 

85 90 95 

Lys Glu Asn Thr Leu Arg He Leu Asp Ala Thr Asn Asn Gin Leu Pro 

100 105 no 

Gin Asp Arg Glu Ser Leu Phe Trp Met Asn Val Lys Ala He Pro Ser 

115 120 125 

Met Asp Lys Ser Lys Leu Thr Glu Asn Thr Leu Gin Leu Ala He He 

130 135 140 

Ser Arg He Lys Leu Tyr Tyr Arg Pro Ala Lys Leu Ala Leu Pro Pro 
145 150 155 160 

Asp Gin Ala Ala. Glu Lys Leu Arg Phe Arg Arg Ser Ala Asn Ser Leu 

165 170 175 

Thr Leu He Asn Pro Thr Pro Tyr Tyr Leu Thr Val Thr Glu Leu Asn 

180 185 190 

Ala Gly Thr Arg Val Leu Glu Asn Ala Leu Val Pro Pro Met Gly Glu 

195 200 205 

Ser Thr Val Lys Leu Pro Ser Asp Ala Gly Ser Asn He Thr Tyr Arg 

210 215 220 

Thr He Asn Asp Tyr Gly Ala Leu Thr Pro Lys Met Thr Gly Val Met 
225 230 235 J 240 

Glu 



<210> 364 
<211> 878 
<212> PRT 
<213> E. Coli 

<400> 364 

Met Ser Tyr Leu Asn Leu Arg Leu Tyr Gin Arg Asn Thr Gin Cys Leu 

1 5 io 15 

His He Arg Lys His Arg Leu Ala Gly Phe Phe Val Arg Leu Val Val 

20 25 30 

Ala Cys Ala Phe Ala Ala Gin Ala Pro Leu Ser Ser Ala Asp Leu Tyr 

35 40 45 

Phe Asn Pro Arg Phe Leu Ala Asp Asp Pro Gin Ala Val Ala Asp Leu 

50 55 60 

Ser Arg Phe Glu Asn Gly Gin Glu Leu Pro Pro Gly Thr Tyr Arg Val 
65 70 75 80 

Asp He Tyr Leu Asn Asn Gly Tyr Met Ala Thr Arg Asp Val Thr Phe 

85 90 95 

Asn Thr Gly Asp Ser Glu Gin Gly He Val Pro Cys Leu Thr Arg Ala 

100 105 no 

Gin Leu Ala Ser Met Gly Leu Asn Thr Ala Ser Val Ala Gly Met Asn 
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115 120 125 

Leu Leu Ala Asp Asp Ala Cys Val Pro Leu Thr Thr Met Val Gin Asp 

130 135 140 

Ala Thr Ala His Leu Asp Val Gly Gin Gin Arg Leu Asn Leu Thr lie 
145 150 155 160 

Pro Gin Ala Phe Met Ser Asn Arg Ala Arg Gly Tyr He Pro Pro Glu 

165 170 " 175 

Leu Trp Asp Pro Gly He Asn Ala Gly Leu Leu Asn Tyr Asn Phe Ser 

180 185 190 

Gly Asn Ser Val Gin Asn Arg He Gly Gly Asn Ser His Tyr Ala Tyr 

195 200 205 

Leu Asn Leu Gin Ser Gly Leu Asn He Gly Ala Trp Arg Leu Arg Asp 

210 215 220 

Asn Thr Thr Trp Ser Tyr Asn Ser Ser Asp Arg Ser Ser Gly Ser Lys 
225 230 235 240 

Asn Lys Trp Gin His He Asn Thr Trp Leu Glu Arg Asp He He Pro 

245 250 255 

Leu Arg Ser Arg Leu Thr Leu Gly Asp Gly Tyr Thr Gin Gly Asp lie 

260 265 ' 270 

Phe Asp Gly He Asn Phe Arg Gly Ala Gin Leu Ala Ser Asp Asp Asn 

275 280 285 

Met Leu Pro Asp Ser Gin Arg Gly Phe Ala Pro Val lie His Gly lie 

290 295 300 

Ala Arg Gly Thr Ala Gin Val Thr lie Lys Gin Asn Gly Tyr Asp lie 
305 310 315 " * 320 

Tyr Asn Ser Thr Val Pro Pro Gly Pro Phe Thr He Asn Asp He Tyr 

325 330 335 

Ala Ala Gly Asn Ser Gly Asp Leu Gin Val Thr lie Lys Glu Ala Asp 

340 345 350 

Gly Ser Thr Gin He Phe Thr Val Pro Tyr Ser Ser Val Pro Leu Leu 

355 360 365 

Gin Arg Glu Gly His Thr Arg Tyr Ser He Thr Ala Gly Glu Tyr Arg 

370 375 380 

Ser Gly Asn Ala Gin Gin Glu Lys Thr Arg Phe Phe Gin Ser Thr Leu 
385 390 395 400 

Leu His Gly Leu Pro Ala Gly Trp Thr lie Tyr Gly Gly Thr Gin Leu 

405 410 ' 415 

Ala Asp Arg Tyr Arg Ala Phe Asn Phe Gly lie Gly Lys Asn Met Gly 

420 425 " 430 

Ala Leu Gly Ala Leu Ser Val Asp Met Thr Gin Ala Asn Ser Thr Leu 

435 440 445 

Pro Asp Asp Ser Gin His Asp Gly Gin Ser Val Arg Phe Leu Tyr Asn 

450 455 460 

Lys Ser Leu Asn Glu Ser Gly Thr Asn lie Gin Leu Val Gly Tyr Arg 
465 470 475 480 

Tyr Ser Thr Ser Gly Tyr Phe Asn Phe Ala Asp Thr Thr Tyr Ser Arg 

485 490 ' 495 

Met Asn Gly Tyr Asn lie Glu Thr Gin Asp Gly Val lie Gin Val Lys 

500 505 ' 510 

Pro Lys Phe Thr Asp Tyr Tyr Asn Leu Ala Tyr Asn Lys Arg Gly Lys 

515 520 525 

Leu Gin Leu Thr Val Thr Gin Gin Leu Gly Arg Thr Ser Thr Leu Tyr 

530 535 540 

Leu Ser Gly Ser His Gin Thr Tyr Trp Gly Thr Ser Asn Val Asp Glu 
545 550 555 560 

Gin Phe Gin Ala Gly Leu Asn Thr Ala Phe Glu Asp lie Asn Trp Thr 

565 570 575 

Leu Ser Tyr Ser Leu Thr Lys Asn Ala Trp Gin Lys Gly Arg Asp Gin 

580 585 590 

Met Leu Ala Leu Asn Val Asn lie Pro Phe Ser His Trp Leu Arg Ser 
595 600 605 
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Asp Ser Lys Ser Gin Trp Arg His Ala Ser Ala Ser Tyr Ser Met Ser 

610 615 620 

His Asp Leu Asn Gly Arg Met Thr Asn Leu Ala Gly Val Tyr Gly Thr 
625 630 635 640 

Leu Leu Glu Asp Asn Asn Leu Ser Tyr Ser Val Gin Thr Gly Tyr Ala 

645 650 655 

Gly Gly Gly Asp Gly Asn Ser Gly Ser Thr Gly Tyr Ala Thr Leu Asn 

660 665 670 

Tyr Arg Gly Gly Tyr Gly Asn Ala Asn He Gly Tyr Ser His Ser Asp 

675 680 685 

Asp lie Lys Gin Leu Tyr Tyr Gly Val Ser Gly Gly Val Leu Ala His 

690 695 700 

Ala Asn Gly Val Thr Leu Gly Gin Pro Leu Asn Asp Thr Val Val Leu 
705 710 715 720 

Val Lys Ala Pro Gly Ala Lys Asp Ala Lys Val Glu Asn Gin Thr Gly 

725 730 735 

Val Arg Thr Asp Trp Arg Gly Tyr Ala Val Leu Pro Tyr Ala Thr Glu 

740 745 750 

Tyr Arg Glu Asn Arg Val Ala Leu Asp Thr Asn Thr Leu Ala Asp Asn 

755 760 765 

Val Asp Leu Asp Asn Ala Val Ala Asn Val Val Pro Thr Arg Gly Ala 

770 775 780 

He Val Arg Ala Glu Phe Lys Ala Arg Val Gly He Lys Leu Leu Met 
785 790 795 800 

Thr Leu Thr His Asn Asn Lys Pro Leu Pro Phe Gly Ala Met Val Thr 

805 810 " 815 

Ser Glu Ser Ser Gin Ser Ser Gly He Val Ala Asp Asn Gly Gin Val 

820 825 830 

Tyr Leu Ser Gly Met Pro Leu Ala Gly Lys Val Gin Val Lys Trp Gly 

835 840 845 

Glu Glu Glu Asn Ala His Cys Val Ala Asn Tyr Gin Leu Pro Pro Glu 

850 855 ~ 860 

Ser Gin Gin Gin Leu Leu Thr Gin Leu Ser Ala Glu Cys Arg 
865 870 875 



<210> 365 
<211> 176 
<212> PRT 
<213> E . Coli 





<400> 


365 
























Met Arg 


Asn 


Lys 


Pro 


Phe 


Tyr 


Leu 


Leu 


Cys 


Ala 


Phe 


Leu Trp Leu Ala 










5 










10 










15 


Val 


Ser 


His 


Ala 


Leu 


Ala 


Ala 


Asp 


Ser 


Thr 


He 


Thr 


He Arg Gly Tyr 








20 










25 










30 




Val Arg 


Asp Asn 


Gly Cys 


Ser 


Val 


Ala 


Ala 


Glu 


Ser 


Thr 


Asn 


Phe Thr 






35 










40 










45 






Val 


Asp 


Leu 


Met 


Glu 


Asn 


Ala 


Ala 


Lys 


Gin 


Phe 


Asn 


Asn 


He 


Gly Ala 




50 










55 










60 






Thr 


Thr 


Pro 


Val 


Val 


Pro 


Phe 


Arg 


He 


Leu 


Leu 


Ser 


Pro 


Cys 


Gly Asn 


65 










70 










75 






80 


Ala 


Val 


Ser 


Ala 


Val 


Lys 


Val 


Gly 


Phe 


Thr Gly Val 


Ala 


Asp 


Ser His 










85 










90 










95 


Asn 


Ala 


Asn 


Leu 


Leu 


Ala 


Leu 


Glu 


Asn 


Thr 


Val 


Ser 


Ala 


Ala 


Ser Gly 








100 










105 










110 


Leu 


Gly 


He 
115 


Gin 


Leu 


Leu 


Asn 


Glu 
120 


Gin 


Gin 


Asn 


Gin 


He 
125 


Pro 


Leu Asn 


Ala 


Pro 


Ser 


Ser 


Ala 


Leu 


Ser 


Trp 


Thr 


Thr 


Leu 


Thr 


Pro 


Gly Lys Pro 




130 










135 










140 








Asn 


Thr 


Leu 


Asn 


Phe 


Tyr Ala Arg 


Leu 


Met 


Ala 


Thr 


Gin 


Val 


Pro Val 
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145 150 155 160 

Thr Ala Gly His He Asn Ala Thr Ala Thr Phe Thr Leu Glu Tyr Gin 
165 170 175 



<210> 366 
<211> 167 
<212> PRT 
<213> E. Coli 

<400> 366 

Met Lys Trp Cys Lys Arg Gly Tyr Val Leu Ala Ala He Leu Ala Leu 

1 5 10 15 

Ala Ser Ala Thr He Gin Ala Ala Asp Val Thr He Thr Val Asn Gly 

20 25 30 

Lys Val Val Ala Lys Pro Cys Thr Val Ser Thr Thr Asn Ala Thr Val 

35 40 45 

Asp Leu Gly Asp Leu Tyr Ser Phe Ser Leu Met Ser Ala Gly Ala Ala 

50 55 60 

Ser Ala Trp His Asp Val Ala Leu Glu Leu Thr Asn Cys Pro Val Gly 
65 70 75 80 

Thr Ser Arg Val Thr Ala Ser Phe Ser Gly Ala Ala Asp Ser Thr Gly 

85 90 " 95 

Tyr Tyr Lys Asn Gin Gly Thr Ala Gin Asn He Gin Leu Glu Leu Gin 

100 105 no 

Asp Asp Ser Gly Asn Thr Leu Asn Thr Gly Ala Thr Lys Thr Val Gin 

115 120 125 

Val Asp Asp Ser Ser Gin Ser Ala His Phe Pro Leu Gin Val Arg Ala 

130 135 140 

Leu Thr Val Asn Gly Gly Ala Thr Gin Gly Thr He Gin Ala Val He 
145 150 155 160 

Ser He Thr Tyr Thr Tyr Ser 
165 



<210> 367 
<211> 300 
<212> PRT 
<213> E. Coli 





<400> 


367 


















Met 


Lys 


Arg 


Val 


He 


Thr 


Leu Phe 


Ala Val 


Leu 


Leu 


Met Gly Trp 


Ser 


1 








5 






10 






15 




Val 


Asn 


Ala 


Trp 


Ser 


Phe 


Ala Cys 


Lys Thr 


Ala 


Asn 


Gly Thr Ala 


He 








20 








25 






30 




Pro 


He 


Gly 


Gly 


Gly 


Ser 


Ala Asn 


Val Tyr 


Val 


Asn 


Leu Ala Pro 


Val 






35 








40 








45 




Val 


Asn 


Val 


Gly 


Gin 


Asn 


Leu Val 


Val Asp 


Leu 


Ser 


Thr Gin He 


Phe 




50 










55 






60 


Cys 


His 


Asn Asp 


Tyr 


Pro 


Glu Thr 


He Thr 


Asp 


Tyr 


Val Thr Leu 


Gin 


65 










70 






75 




80 


Arg Gly 


Ser 


Ala 


Tyr Gly Gly Val 


Leu Ser 


Asn 


Phe 


Ser Gly Thr 


Val 










85 






90 






95 




Lys 


Tyr 


Ser 


Gly 


Ser 


Ser 


Tyr Pro 


Phe Pro 


Thr 


Thr 


Ser Glu Thr 


Pro 








100 








105 






110 




Arg 


Val 


Val 


Tyr 


Asn 


Ser 


Arg Thr 


Asp Lys 


Pro 


Trp 


Pro Val Ala 


Leu 






115 








120 






125 




Tyr 


Leu 


Thr 


Pro 


Val 


Ser 


Ser Ala Gly Gly Val Ala 


lie Lys Ala 


Gly 




130 










135 






140 
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Ser Leu He Ala Val 
145 

Asp Asp Phe Gin Phe 
165 

.Val Pro Thr Gly Gly 
180 

Leu Pro Asp Tyr Pro 
195 

Ala Lys Ser Gin Asn 
210 

Ala Gly Asn Ser He 
225 

Gly Val Gly Val Gin 
245 

Asn Thr Val Ser Leu 
260 

Leu Thr Ala Asn Tyr 
275 

Val Gin Ser . He He 
290 



Leu He Leu Arg Gin Thr 
150 155 
Val Trp Asn He Tyr Ala 
170 

Cys Asp Val Ser Ala Arg 
185 

Gly Ser Val Pro He Pro 
200 

Leu Gly Tyr Tyr Leu Ser 
215 

Phe Thr Asn Thr Ala Ser 
230 235 
Leu Thr Arg Asn Gly Thr 
250 

Gly Ala Val Gly Thr Ser 
265 

Ala Arg Thr Gly Gly Gin 
280 

Gly Val Thr Phe Val Tyr 
295 



Asn Asn Tyr Asn Ser 
160 

Asn Asn Asp Val Val 
175 

Asp Val Thr Val Thr 
190 

Leu Thr Val Tyr Cys 
205 

Gly Thr Thr Ala Asp 
220 

Phe Ser Pro Ala Gin 
240 

He He Pro Ala Asn 
255 

Ala Val Ser Leu Gly 
270 

Val Thr Ala Gly Asn 
285 

Gin 
300 



<210> 368 
<211> 521 
<212> PRT 
<213> E. Coli 



<400> 


368 














Met Leu 


Ser 


Lys 


Leu 


Pro Arg Arg 


Leu Arg Ser Phe Gin Thr Tyr Cys 


1 






5 








10 


15 


Thr lie 


Arg Val 


His Arg Gly Glu Asp Met Lys Ser Met 


Asp Lys Leu 






20 










25 


30 


Thr Thr 


Gly Val 


Ala 


Tyr 


Gly 


Thr 


Ser Ala Gly Asn Ala 


Gly Phe Trp 




35 










40 


45 


Ala Leu 


Gin 


Leu 


Leu Asp 


Lys 


Val 


Thr Pro Ser Gin Trp Ala Ala lie 


50 










55 




60 




Gly Val 


Leu 


Gly 


Ser 


Leu 


Val 


Phe Gly Leu Leu Thr Tyr Leu Thr Asn 


65 








70 






75 


80 


Leu Tyr 


Phe 


Lys 


He 


Lys Glu Asp Arg Arg Lys Ala Ala 


Arg Gly Glu 








85 








90 


95 


Ser Asn 


Asp 


Ser 


Arg 


Leu 


Thr Gly Cys Glu Arg Ser Pro 


Phe Glu Ser 






100 










105 


110 


Tyr Gly 


Asn 


Cys 


Ser 


Leu 


Thr 


Gly Gin Arg Thr Leu Arg Asn Phe Pro 




115 










120 


125 




Gly Cys 


Arg 


His 


Gly 


Pro 


Cys 


Arg 


Ser Cys Ala Gly Val 


Leu Gly Ser 


130 










135 




140 




Ser Gin 


Lys 


Glu 


Arg 


Pro 


Ala 


Ser 


Leu Pro Gly Ser Ser 


Arg Lys lie 


145 








150 






155 


160 


Val Arg 


Lys 


Ser 


Val 


Leu 


Ser 


Ala 


Ala Ser Val Leu Leu 


Asp Lys Ser 








165 








170 


175 


Cys Gin 


Ala 


Arg 


Ala 


Ser 


Ser 


Ser 


lie Ser Met Asn Thr 


Lys lie Arg 






180 










185 


190 


Tyr Gly 


Leu 


Ser 


Ala 


Ala 


Val 


Leu 


Ala Leu He Gly Ala 


Gly Ala Ser 




195 










200 


205 




Ala Pro 


Gin 


He 


Leu Asp Gin 


Phe 


Leu Asp Glu Lys Glu 


Gly Asn His 


210 










215 




220 




Thr Met 


Ala 


Tyr 


Arg Asp Gly Ser Gly He Trp Thr lie 


Cys Arg Gly 


225 








230 






235 


240 


Ala Thr 


Val 


Val 


Asp Gly 


Lys 


Thr 


Val Phe Pro Asn Met 


Lys Leu Ser 



245 250 255 



-184- 



WO 00/44906 



Lvs 


Glu 


Lvs 


Cvs 


Asp Gin 


Val 


Asn 


Ala 








260 










265 


Ala 


Trp 


Val 
275 


Glu 


Arg 


Asn 


He 


Lys 
280 


Val 


Ala 


Gly 
290 


lie 


Ala 


Ser 


Phe 


Cys 
295 


Pro 


Tyr 


Phe 


Pro 


Ser 


Thr 


Phe 


Tyr 


Lys 


Arg 


Leu 


305 










310 








Ala 


Cvs 


Glu 


Ala 


He 
325 


Arg 


Trp 


Trp 


He 


Arq 


lie 


Arg 


Ser 


Asn 


Asn 


Cys 


Tyr Gly 








340 










345 


Glu 


Ser 


Ala 


Leu 


Thr Cys 


Trp 


Gly 


He 






355 










360 




Phe 


Phe 


Ser Cys 


Cys 


Gin 


Asp 


Leu 


Ser 




370 










375 






Glu 


Asp 


Gly Lys 


Lys Asn Gly 


Arg Asn 


385 










390 








Lvs 


Ara 


Met 


Leu 


Asn 
405 


Leu 


Leu 


Leu 


Glu 


Val 


Met 


Arg 


Leu 


Leu Lys 


Met 


Arg Asn 








420 










425 


Pro 


Glv 


Leu 
435 


Leu 


He 


Cys 


Leu 


He 
440 


Val 


Gin 


Lvs 
450 


Asn 


Met 


Pro 


Tyr 


Ala 
455 


Leu 


Thr 


Leu 


Pro 


Leu 


Pro 


Ser 


Glu 


Ala 


Lys 


Gin 


465 










470 








Ser 


Pro 


Thr Cys 


Ser 


Glu 


He 


Leu 


Gin 










485 










Lys 


Leu 


Leu 


Thr 
500 


Asn 


Ala 


Thr 


Ser 


Gin 
505 


Asn 


Leu 


Glu 
515 


He 


Lys 


Ser 


He 


Lys 
520 


Cys 



<210> 369 
<211> 177 
<212> PRT 
<213> E. Coli 





<400> 


369 












Met 


Asn 


Thr 


Lys 


He Arg 


Tyr 


Gly Leu 


1 








5 










He 


Gly 


Ala 


Gly 


Ala 


Ser 


Ala 


Pro 


Gin 








20 










25 


Glu 


Lys 


Glu 


Gly 


Asn 


His 


Thr 


Met 


Ala 






35 










40 




Trp 


Thr 


He 


Cys 


Arg Gly Ala 


Thr 


Val 




50 










55 






Pro 


Asn 


Met 


Lys 


Leu 


Ser 


Lys 


Glu 


Lys 


65 










70 








Glu Arg 


Asp 


Lys 


Ala 


Leu 


Ala 


Trp Val 










85 










Leu 


Thr 


Glu 


Pro 


Gin 


Lys 


Ala 


Gly 


lie 








100 










105 


He 


Gly 


Pro 


Gly 


Lys 


Cys 


Phe 


Pro 


Ser 






115 










120 




Ala 


Gly 


Asp Arg 


Lys 


Gly Ala 


Cys 


Glu 




130 










135 







PCT/US00/02200 

lie Glu Arg Asp Lys Ala Leu 
270 

Pro Leu Thr Glu Pro Gin Lys 
285 

Asn He Gly Pro Gly Lys Cys 
300 

Asn Ala Gly Asd Arg Lys Gly 
315 " 320 

Lys Asp Gly Gly Arg Asp Cys 
330 335 
Gin Val He Arg Arg Asp Gin 
350 

Glu Gin He Arg Tyr Ser Trp 
365 

Ser Glu Met Ser Gly Ala Thr 
380 

Val Met Leu Pro His Tyr His 
395 400 
Leu Asn Arg Gly Glu Leu Pro 
410 415 
Arg Asn Leu Leu Lys Phe Leu 
430 

Leu Thr Ser Cys Val Pro Lys 
445 

Gin Arg Ser lie Pro Gin He 
4 60 

Pro Lys Pro Pro Lys Glu Cys 
475 480 
Gin Lys Leu Ser Phe Met Leu 
490 495 
Glu Leu Val Asn Arg Ser Met 
510 



Ser Ala Ala Val Leu Ala Leu 
10 15 
lie Leu Asp Gin Phe Leu Asp 
30 

Tyr Arg Asp Gly Ser Gly He 
45 

Val Asp Gly Lys Thr Val Phe 
60 

Cys Asp Gin Val Asn Ala He 

75 80 
Glu Arg Asn lie Lys Val Pro 
90 95 
Ala Ser Phe Cys Pro Tyr Asn 
110 

Thr Phe Tyr Lys Arg Leu Asn 
125 

Ala lie Arg Trp Trp lie Lys 
14 0 
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Asp Gly Gly Arg Asp Cys Arg He Arg Ser Asn Asn Cys Tyr Gly Gin 
145 150 155 160 

Val He Arg Arg Asp Gin Glu Ser Ala Leu Thr Cys Trp Gly He Glu 
165 170 175 

. Gin 



<210> 370 
<211> 103 
<212> PRT 
<213> E. Coli 





<400> 


370 














Met 


Thr 


Gin Asp 


Tyr 


Glu 


Leu 


Val 


Val 


Lys Gly Val Arg Asn Phe Glu 


1 








5 










10 15 


Asn 


Lys 


Val 


Thr 
20 


Val 


Thr 


Val 


Ala 


Leu 
25 


Gin Asp Lys Glu Arg Phe Asp 
30 


Gly 


Glu 


He 


Phe 


Asp Leu 


Asp Val Ala 


Met Asp Arg Val Glu Gly Ala 






35 










40 




45 


Ala 


Leu 
50 


Glu 


Phe 


Tyr 


Glu 


Ala 

55 


Ala 


Ala 


Arg Arg Ser Val Arg Gin Val 
60 


Phe 


Leu 


Glu 


Val 


Ala 


Glu 


Lys 


Leu 


Ser 


Glu Lys Val Glu Ser Tyr Leu 


65 










70 








75 80 


Gin 


His 


Gin 


Tyr 


Ser 
85 


Phe 


Lys 


He 


Glu 


Asn Pro Ala Asn Lys His Glu 
90 95 


Arg 


Pro 


His 


His 
100 


Lys 


Tyr 


Leu 









<210> 371 
<211> 96 
<212> PRT 
<213> E. Coli 





<400> 


371 








Met 
1 


Leu 


Ser 


Lys 


Leu 

5 


Pro Arg Arg 


Thr 


He 


Arg 


Val 
20 


His 


Arg Gly 


Glu 


Thr 


Thr 


Gly 
35 


Val 


Ala 


Tyr Gly 


Thr 
40 


Ala 


Leu 
50 


Gin 


Leu 


Leu Asp Lys 
55 


Val 


Gly Val 
65 


Leu 


Gly 


Ser 


Leu Val 
70 


Phe 


Leu 


Tyr 


Phe 


Lys 


He 


Lys Glu 


Asp 



85 



Leu Arg Ser Phe Gin Thr Tyr Cys 

10 15 
Asp Met Lys Ser Met Asp Lys Leu 
25 30 
Ser Ala Gly Asn Ala Gly Phe Trp 
45 

Thr Pro Ser Gin Trp Ala Ala He 
60 

Gly Leu Leu Thr Tyr Leu Thr Asn 

75 80 
Arg Arg Lys Ala Ala Arg Gly Glu 
90 95 



<210> 372 
<211> 71 
<212> PRT 
<213> E. Coli 

<400> 372 
Met Ser Asn Lys Met Thr Gly Leu 

1 5 
Gly Phe Gly Phe He Ser Pro Val 



Val Lys Trp Phe Asn Ala Asp Lys 

10 15 
Asp Gly Ser Lys Asp Val Phe Val 
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20 25 30 

His Phe Ser Ala He Gin Asn Asp Asn Tyr Arg Thr Leu Phe Glu Gly 

35 40 45 

Gin Lys Val Thr Phe Ser He Glu Ser Gly Ala Lys Gly Pro Ala Ala 

50 55 60 

Ala Asn Val He He Thr Asp 
65 70 



<210> 373 
<211> 338 
<212> PRT 
<213> E. Coli 

<400> 373 

Met Phe Val He Trp Ser His Arg Thr Gly Phe He Met Ser His Gin 

15 10 15 

Leu Thr Phe Ala Asp Ser Glu Phe Ser Ser Lys Arg Arg Gin Thr Arg 

20 25 30 

Lys Glu He Phe Leu Ser Arg Met Glu Gin lie Leu Pro Trp Gin Asn 

35 40 45 

Met Val Glu Val He Glu Pro Phe Tyr Pro Lys Ala Gly Asn Gly Arq 

50 55 60 

Arg Pro Tyr Pro Leu Glu Thr Met Leu Arg He His Cys Met Gin His 
65 70 75 - 80 

Trp Tyr Asn Leu Ser Asp Gly Ala Met Glu Asp Ala Leu Tyr Glu He 

85 90 95 

Ala Ser Met Arg Leu Phe Ala Arg Leu Ser Leu Asp Ser Ala Leu Pro 

100 105 no 

Asp Arg Thr Thr He Met Asn Phe Arg His Leu Leu Glu Gin His Gin 

115 120 125 

Leu Ala Arg Gin Leu Phe Lys Thr He Asn Arg Trp Leu Ala Glu Ala 

130 135 140 

Gly Val Met Met Thr Gin Gly Thr Leu Val Asp Ala Thr He lie Glu 
1^5 150 155 160 

Ala Pro Ser Ser Thr Lys Asn Lys Glu Gin Gin Arg Asp Pro Glu Met 

165 170 175 

His Gin Thr Lys Lys Gly Asn Gin Trp His Phe Gly Met Lys Ala His 

180 185 ' 190 

He Gly Val Asp Ala Lys Ser Gly Leu Thr His Ser Leu Val Thr Thr 

195 200 205 

Ala Ala Asn Glu His Asp Leu Asn Gin Leu Gly Asn Leu Leu His Gly 

210 215 220 

Glu Glu Gin Phe Val Ser Ala Asp Ala Gly Tyr Gin Gly Ala Pro Gin 
225 230 235 2 40 

Arg Glu Glu Leu Ala Glu Val Asp Val Asp Trp Leu He Ala Glu Arg 

245 250 255 

Pro Gly Lys Val Arg Thr Leu Lys Gin His Pro Arg Lys Asn Lys Thr 

260 265 270 

Ala He Asn lie Glu Tyr Met Lys Ala Ser He Arg Ala Arg Val Glu 

275 280 285 

His Pro Phe Arg lie lie Lys Arg Gin Phe Gly Phe Val Lys Ala Arg 

290 295 300 

Tyr Lys Gly Leu Leu Lys Asn Asp Asn Gin Leu Ala Met Leu Phe Thr 
305 310 315 320 

Leu Ala Asn Leu Phe Arg Ala Asp Gin Met lie Arg Gin Trp Glu Arg 
325 330 335 

Ser His 
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<210> 374 
<211> 157 
<212> PRT 
<213> E. Coli 

<400> 374 

Met Val Tyr He He lie Val Ser His Gly His Glu Asp Tyr He Lys 

15 10 15 

Lys Leu Leu Glu Asn Leu Asn Ala Asp Asp Glu His Tyr Lys He He 

20 25 30 

Val Arg Asp Asn Lys Asp Ser Leu Leu Leu Lys Gin He Cys Gin His 

35 40 45 

Tyr Ala Gly Leu Asp Tyr He Ser Gly Gly Val Tyr Gly Phe Gly His 

50 55 60 

Asn Asn Asn He Ala Val Ala Tyr Val Lys Glu Lys Tyr Arg Pro Ala 
65 ? 0 75 ^ 80 

Asp Asp Asp Tyr He Leu Phe Leu Asn Pro Asp He He Met Lys His 

85 90 95 

Asp Asp Leu Leu Thr Tyr He Lys Tyr Val Glu Ser Lys Arg Tyr Ala 

100 105 no 

Phe Ser Thr Leu Cys Leu Phe Arg Asp Glu Ala Lys Ser Leu His Asp 

115 120 125 

Tyr Ser Val Arg Lys Phe Pro Val Leu Ser Asp Phe He Val Ser Phe 

130 135 140 

Met Leu Gly He Lys Glu Gly Ala Asn Lys Ser Leu He 
145 150 155 



<210> 375 
<211> 372 
<212> PRT 
<213> E. Coli 

<400> 375 

Met Gly Lys Ser He Val Val Val Ser Ala Val Asn Phe Thr Thr Glv 

1 5 io 15 

Gly Pro Phe Thr He Leu Lys Lys Phe Leu Ala Ala Thr Asn Asn Lvs 

20 25 30 

Glu Asn Val Ser Phe He Ala Leu Val His Ser Ala Lys Glu Leu Lys 

35 40 45 

Glu Ser Tyr Pro Trp Val Lys Phe He Glu Phe Pro Glu Val Lys Glv 

50 55 60 

Ser Trp Leu Lys Arg Leu His Phe Glu Tyr Val Val Cys Lys Lys Leu 
65 7 0 75 80 

Ser Lys Glu Leu Asn Ala Thr His Trp He Cys Leu His Asp He Thr 

85 90 95 

Ala Asn Val Val Thr Lys Lys Arg Tyr Val Tyr Cys His Asn Pro Ala 

100 105 no 

Pro Phe Tyr Lys Gly He Leu Phe Arg Glu He Leu Met Glu Pro Ser 

115 120 125 

Phe Phe Leu Phe Lys Met Leu Tyr Gly Leu He Tyr Lys He Asn He 

130 135 140 

Lys Lys Asn Thr Ala Val Phe Val Gin Gin Phe Trp Met Lys Glu Lys 
145 150 155 160 

Phe He Lys Lys Tyr Ser He Asn Asn He He Val Ser Arg Pro Glu 

165 170 175 

He Lys Leu Ser Asp Lys Ser Gin Leu Thr Asp Asp Asp Ser Gin Phe 

180 185 190 

Lys Asn Asn Pro Ser Glu Leu Thr lie Phe Tyr Pro Ala Val Pro Arg 
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195 200 205 

Val Phe Lys Asn Tyr Glu Leu He He Ser Ala Ala Arg Lys Leu Lvs 

210 215 220 

Glu Gin Ser Asn He Lys Phe Leu Leu Thr He Ser Gly Thr Glu Asn 
225 230 235 ' 240 

Ala Tyr Ala Lys Tyr He lie Ser Leu Ala Glu Gly Leu Asp Asn Val 

245 250 ' 255 

His Phe Leu Gly Tyr Leu Asp Lys Glu Lys lie Asp His Cys Tyr Asn 

260 265 270 

lie Ser Asp lie Val Cys Phe Pro Ser Arg Leu Glu Thr Trp Gly Leu 

275 280 " 285 

Pro Leu Ser Glu Ala Lys Glu Arg Gly Lys Trp Val Leu Ala Ser Asp 

290 295 300 

Phe Pro Phe Thr Arg Glu Thr Leu Gly Ser Tyr Glu Lys Lys Ala Phe 
305 310 315 * J 320 

Phe Asp Ser Asn Asn Asp Asp Met Leu Val Lys Leu He He Asp Phe 

325 330 335 

Lys Lys Gly Asn Leu Lys Lys Asp lie Ser Asp Ala Asn Phe lie Tyr 

340 345 350 

Arg Asn Glu Asn Val Leu Val Gly Phe Asp Glu Leu Val Asn Phe He 

355 360 365 

Thr Glu Glu His 
370 



<210> 376 
<211> 196 
<212> PRT 
<213> E. Coli 

<400> 376 

Met lie Leu Lys Leu Ala Lys Arg Tyr Gly Leu Cys Gly Phe lie Arg 

1 5 10 15 

Leu Val Arg Asp Val Leu Leu Thr Arg Val Phe Tyr Arg Asn Cys Arq 

20 25 30 

lie lie Arg Phe Pro Cys Tyr lie Arg Asn Asp Gly Ser He Asn Phe 

35 40 45 

Gly Glu Asn Phe Thr Ser Gly Val Gly Leu Arg Leu Asp Ala Phe Gly 

50 55 60 

Arg Gly Val lie Phe Phe Ser Asp Asn Val Gin Val Asn Asp Tyr Val 
6 $ 70 75 80 

His lie Ala Ser lie Glu Ser Val Thr lie Gly Arg Asp Thr Leu lie 

85 90 95 

Ala Ser Lys Val Phe lie Thr Asp His Asn His Gly Ser Phe Lys His 

100 105 HO 

Ser Asp Pro Met Ser Ser Pro Asn He Pro Pro Asp Met Arg Thr Leu 

115 120 125 

Glu Ser Ser Ala Val Val lie Gly Gin Arg Val Trp Leu Gly Glu Asn 

130 135 140 

Val Thr Val Leu Pro Gly Thr lie lie Gly Asn Gly Val Val Val Gly 
145 150 155 160 

Ala Asn Ser Val Val Arg Gly Ser lie Pro Glu Asn Thr Val lie Ala 

165 170 175 

Gly Val Pro Ala Lys lie lie Lys Lys Tyr Asn His Glu Thr Lys Leu 

180 185 190 

Trp Glu Lys Ala 
195 

<210> 377 
<211> 330 
<212> PRT 
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<213> E. Coli 
<400> 377 

Met Tyr Phe Leu Asn Asp Leu Asn Phe Ser Arg Arg Asp Ala Gly Phe 

1 5 10 1 15 

Lys Ala Arg Lys Asp Ala Leu Asp He Ala Ser Asp Tyr Glu Asn He 

20 25 30 

Ser Val Val Asn He Pro Leu Trp Gly Gly Val Val Gin Arg He He 

35 40 45 

Ser Ser Val Lys Leu Ser Thr Phe Leu Cys Gly Leu Glu Asn Lys Asd 

50 55 60 

Val Leu He Phe Asn Phe Pro Met Ala Lys Pro Phe Trp His He Leu 
65 7 0 75 80 

Ser Phe Phe His Arg Leu Leu Lys Phe Arg He Val Pro Leu lie His 

85 90 95 

Asp He Asp Glu Leu Arg Gly Gly Gly Gly Ser Asp Ser Val Arg Leu 

100 105 no 

Ala Thr Cys Asp Met Val He Ser His Asn Pro Gin Met Thr Lys Tvr 

115 120 125 

Leu Ser Lys Tyr Met Ser Gin Asp Lys He Lys Asp He Lys He Phe 

130 135 140 

Asp Tyr Leu Val Ser Ser Asp Val Glu His Arg Asp Val Thr Asp Lys 
145 ISO 155 160 

Gin Arg Gly Val He Tyr Ala Gly Asn Leu Ser Arg His Lys Cys Ser 

165 170 175 

Phe He Tyr Thr Glu Gly Cys Asp Phe Thr Leu Phe Gly Val Asn Tyr 

180 185 190 

Glu Asn Lys Asp Asn Pro Lys Tyr Leu Gly Ser Phe Asp Ala Gin Ser 

195 200 205 

Pro Glu Lys He Asn Leu Pro Gly Met Gin Phe Gly Leu He Trp Asp 

210 215 220 

Gly Asp Ser Val Glu Thr Cys Ser Gly Ala Phe Gly Asp Tyr Leu Lys 
225 230 235 ' 240 

Phe Asn Asn Pro His Lys Thr Ser Leu Tyr Leu Ser Met Glu Leu Pro 

245 250 255 

Val Phe He Trp Asp Lys Ala Ala Leu Ala Asp Phe He Val Asp Asn 

260 265 270 

Arg He Gly Tyr Ala Val Gly Ser He Lys Glu Met Gin Glu He Val 

275 280 285 

Asp Ser Met Thr He Glu Thr Tyr Lys Gin He Ser Glu Asn Thr Lys 

290 295 300 

He He Ser Gin Lys He Arg Thr Gly Ser Tyr Phe Arg Asp Val Leu 
305 310 315 ' ' 320 

Glu Glu Val He Asp Asp Leu Lys Thr Arg 
325 330 



<210> 378 

<211> 388 

<212> PRT 

<213> E. Coli 

<400> 378 

Met He Tyr Leu Val lie Ser Val 

1 5 

Leu Tyr Leu Lys Lys Asd He Phe 
20 

He Phe Ala Leu Val Leu Leu Gly 

35 40 

Ala Phe Gin Leu Asn Asd Ala Thr 
50 55 



Phe Leu He Thr Ala Phe lie Cys 

10 15 
Tyr Pro Ala Val Cys Val Asn lie 
25 30 
Tyr Glu He Thr Ser Asp lie Tyr 
45 

Leu He Phe Leu Leu Cys Asn Val 
60 
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Leu Thr Phe Thr Leu Ser Cys Leu Leu Thr Glu Ser Val Leu Asp Leu 
65 70 75 80 

Asn He Arg Lys Val Asn Asn Ala He Tyr Ser He Pro Ser Lys Lys 

85 90 95 

Val His Asn Val Gly Leu Leu Val He Ser Phe Ser Met He Tyr He 

100 105 no 

Cys Met Arg Leu Ser Asn Tyr Gin Phe Gly Thr Ser Leu Leu Ser Tyr 

115 120 125 

Met Asn Leu He Arg Asp Ala Asp Val Glu Asp Thr Ser Arg Asn Phe 

130 135 140 

Ser Ala Tyr Met Gin Pro He He Leu Thr Thr Phe Ala Leu Phe He 
145 150 155 160 

Trp Ser Lys Lys Phe Thr Asn Thr Lys Val Ser Lys Thr Phe Thr Leu 

165 170 175 

Leu Val Phe He Val Phe He Phe Ala He He Leu Asn Thr Gly Lys 

180 185 190 

Gin He Val Phe Met Val He He Ser Tyr Ala Phe He Val Gly Val 

195 200 205 

Asn Arg Val Lys His Tyr Val Tyr Leu lie Thr Ala Val Gly Val Leu 

210 215 220 

Phe Ser Leu Tyr Met Leu Phe Leu Arg Gly Leu Pro Gly Gly Met Ala 
225 230 235 240 

Tyr Tyr Leu Ser Met Tyr Leu Val Ser Pro lie lie Ala Phe Gin Glu 

245 250 255 

Phe Tyr Phe Gin Gin Val Ser Asn Ser Ala Ser Ser His Val Phe Trp 

260 265 270 

Phe Phe Glu Arg Leu Met Gly Leu Leu Thr Gly Gly Val Ser Met Ser 

275 280 285 

Leu His Lys Glu Phe Val Trp Val Gly Leu Pro Thr Asn Val Tyr Thr 

290 295 300 

Ala Phe Ser Asp Tyr Val Tyr lie Ser Ala Glu Leu Ser Tyr Leu Met 
305 310 315 320 

Met Val lie His Gly Cys lie Ser Gly Val Leu Trp Arg Leu Ser Arg 

325 330 " 335 

Asn Tyr lie Ser Val Lys lie Phe Tyr Ser Tyr Phe lie Tyr Thr Phe 

340 345 350 

Ser Phe lie Phe Tyr His Glu Ser Phe Met Thr Asn lie Ser Ser Trp 

355 360 365 

lie Gin lie Thr Leu Cys lie lie Val Phe Ser Gin Phe Leu Lys ALa 

370 375 380 

Gin Lys lie Lys 
385 



<210> 379 
<211> 367 
<212> PRT 
<213> E. Coli 

<400> 379 
Met Tyr Asp Tyr lie lie Val Gly 

1 5 
Ala Asn Glu Leu Lys Lys Leu Asn 
20 

Arg Asn His lie Gly Gly Asn Ala 

35 - 40 

Gin lie His Lys Tyr Gly Ala His 

50 55 
lie Trp Asp Tyr Val Asn Asp Leu 
65 70 



Ser Gly Leu Phe Gly Ala Val Cys 

10 15 
Lys Lys Val Leu Val lie Glu Lys 
25 30 
Tyr Thr Glu Asp Cys Glu Gly lie 
4 5 

lie Phe His Thr Asn Asp Lys Tyr 
60 

Val Glu Phe Asn Arg Phe Thr Asn 
75 80 
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Ser Pro Leu Ala He Tyr Lys Asp Lys Leu Phe Asn Leu Pro Phe Asn 

85 90 95 

Met Asn Thr *Phe His Gin Met Trp Gly Val Lys Asd Pro Gin Glu Ala 

100 105 no 

.Gin Asn He He Asn Ala Gin Lys Lys Lys Tyr Gly Asp Lys Val Pro 

115 120 125 

Glu Asn Leu Glu Glu Gin Ala He Ser Leu Val Gly Glu Asp Leu Tyr 

130 135 140 

Gin Ala Leu He Lys Gly Tyr Thr Glu Lys Gin Trp Gly Arg Ser Ala 
145 150 155 160 

Lys Glu Leu Pro Ala Phe lie lie Lys Arg He Pro Val Arg Phe Thr 

165 170 175 

Phe Asp Asn Asn Tyr Phe Ser Asp Arg Tyr Gin Gly He Pro Val Gly 

180 185 ' 190 

Gly Tyr Thr Lys Leu lie Glu Lys Met Leu Glu Gly Val Asp Val Lys 

195 200 205 

Leu Gly lie Asp Phe Leu Lys Asp Lys Asp Ser Leu Ala Ser Lys Ala 

210 215 220 

His Arg lie lie Tyr Thr Gly Pro lie Asp Gin Tyr Phe Asp Tyr Arq 
225 230 235 240 

Phe Gly Ala Leu Glu Tyr Arg Ser Leu Lys Phe Glu Thr Glu Arg His 

245 250 255 

Glu Phe Pro Asn Phe Gin Gly Asn Ala Val lie Asn Phe Thr Asp Ala 

260 265 270^ 

Asn Val Pro Tyr Thr Arg lie lie Glu His Lys His Phe Asp Tyr Val 

275 280 285 

Glu Thr Lys His Thr Val Val Thr Lys Glu Tyr Pro Leu Glu Trp Lys 

290 295 300 

Val Gly Asp Glu Pro Tyr Tyr Pro Val Asn Asp Asn Lys Asn Met Glu 
305 310 315 320 

Leu Phe Lys Lys Tyr Arg Glu Leu Ala Ser Arg Glu Asp Lys Val lie 

325 330 335 

Phe Gly Gly Arg Leu Ala Glu Tyr Lys Tyr Tyr Asp Met His Gin Val 

340 345 350 

He Ser Ala Ala Leu Tyr Gin Val Lys Asn lie Met Ser Thr Asp 
355 360 365 



<210> 380 
<211> 371 
<212> PRT 
<213> E. Coli 

<400> 380 

Met Phe Pro Lys lie Met Asn Asp Glu Asn Phe Phe Lys Lys Ala Ala 

1 5 10 15 

Ala His Gly Glu Glu Pro Pro Leu Thr Pro Gin Asn Glu His Gin Arg 

20 25 30 

Ser Gly Leu Arg Phe Ala Arg Arg Val Arg Leu Pro Arg Ala Val Gly 

35 40 45 

Leu Ala Gly Met Phe Leu Pro lie Ala Ser Thr Leu Val Ser His Pro 

50 55 60 

Pro Pro Gly Trp Trp Trp Leu Val Leu Val Gly Trp Ala Phe Val Trp 
65 70 75 80 

Pro His Leu Ala Trp Gin lie Ala Ser Arg Ala Val Asp Pro Leu Ser 

85 90 95 

Arg Glu lie Tyr Asn Leu Lys Thr Asp Ala Val Leu Ala Gly Met Trp 

100 105 HO 

Val Gly Val Met Gly Val Asn Val Leu Pro Ser Thr Ala Met Leu Met 

115 120 125 

lie Met Cys Leu Asn Leu Met Gly Ala Gly Gly Pro Arg Leu Phe Val 



-192- 



WO 00/44906 



PCT/USOO/02200 



130 135 140 

Ala Gly Leu Val Leu Met Val Val Ser Cys Leu Val Thr Leu Glu Leu 
145 150 155 160 

Thr Gly He Thr Val Ser Phe Asn Ser Ala Pro Leu Glu Trp Trp Leu 

165 170 175 

Ser Leu Pro He He Val He Tyr Pro Leu Leu Phe Gly Trp Val Ser 

180 185 190 

Tyr Gin Thr Ala Thr Lys Leu Ala Glu His Lys Arg Arg Leu Gin Val 

195 200 205 

Met Ser Thr Arg Asp Gly Met Thr Gly Val Tyr Asn Arg Arg His Trp 

210 215 220 

Glu Thr Met Leu Arg Asn Glu Phe Asp Asn Cys Arg Arg His Asn Arg 
225 230 235 240 

Asp Ala Thr Leu Leu He He Asp He Asp His Phe Lys Ser He Asn 

245 250 " 255 

Asp Thr Trp Gly His Asp Val Gly Asp Glu Ala He Val Ala Leu Thr 

260 265 270 

Arg Gin Leu Gin He Thr Leu Arg Gly Ser Asp Val He Gly Arg Phe 

275 280 * 285 

Gly Gly Asp Glu Phe Ala Val He Met Ser Gly Thr Pro Ala Glu Ser 

290 295 300 

Ala He Thr Ala Met Leu Arg Val His Glu Gly Leu Asn Thr Leu Arq 
305 310 315 320 

Leu Pro Asn Thr Pro Gin Val Thr Leu Arg He Ser Val Gly Val Ala 

325 330 335 

Pro Leu Asn Pro Gin Met Ser His Tyr Arg Glu Trp Leu Lys Ser Ala 

340 345 350 

Asp Leu Ala Leu Tyr Lys Ala Lys Lys Ala Gly Arg Asn Arg Thr Glu 

355 360 365 

Val Ala Ala 
370 

<210> 381 
<211> 467 
<212> PRT 
<213> £. Coli 

<400> 381 

Met Asp Val Asn Val Asp Gin Phe Asp Thr Glu Ala Phe Arg Thr Asp 

1 5 10 15 

Lys Leu Glu Leu Thr Ser Gly Asn He Ala Asp His Asn Gly Asn Val 

20 25 " 30 

Val Ser Gly Val Phe Asp He His Ser Ser Asp Tyr Val Leu Asn Ala 

35 40 45 

Asp Leu Val Asn Asp Arg Thr Trp Asp Thr Ser Lys Ser Asn Tyr Gly 

50 55 60 

Tyr Gly He Val Ala Met Asn Ser Asp Gly His Leu Thr He Asn Gly 
65 70 75 80 

Asn Gly Asp Val Asp Asn Gly Thr Glu Leu Asp Asn Ser Ser Val Asp 

85 90 95 

Asn Val Val Ala Ala Thr Gly Asn Tyr Lys Val Arg He Asp Asn Ala 

100 105 no 

Thr Gly Ala Gly Ala He Ala Asp Tyr Lys Asp Lys Glu He He Tyr 

115 120 125 

Val Asn Asp Val Asn Ser Asn Ala Thr Phe Ser Ala Ala Asn Lys Ala 

130 135 140 

Asp Leu Gly Ala Tyr Thr Tyr Gin Ala Glu Gin Arg Gly Asn Thr Val 
145 150 155 160 

Val Leu Gin Gin Met Glu Leu Thr Asp Tyr Ala Asn Met Ala Leu Ser 

165 170 175 

He Pro Ser Ala Asn Thr Asn He Trp Asn Leu Glu Gin Asp Thr Val 



-193- 



WO 00/44906 



PCT/USOO/02200 



180 185 
Gly Thr Arg Leu Thr Asn Ser Arg His 

195 200 
Ala Trp Val Ser Tyr Phe Gly Gly Asn 

210 215 
lie Asn Tyr Asp Gin Asp Val Asn Gly 
225 230 
Lys lie Asp Gly Asn Asn Ala Lys Trp 
245 

Phe Ala Lys Gly Asp Met Asn Asp Arg 
260 265 
Ser Gin Thr Ala Tyr He Tyr Ser Ser 

275 280 
Phe Val Asp Gly Ser Leu Ser Tyr Ser 

290 295 
Ala Thr Met Ser Asn Gly Thr Tyr Val 
305 310 
Ala Trp Gly Phe Gly Leu Lys Ala Gly 
325 

Ala Gly Tyr Val Thr Pro Tyr Gly Ser 
340 345 
Gly Asp Asp Tyr Gin Leu Ser Asn Asp 

355 360 
Tyr Asp Ser Met Arg Tyr Glu Leu Gly 

370 375 
Thr Tyr Ser Glu Asp Gin Ala Leu Thr 
385 390 
Val Tyr Asp Asp Ser Asn Asn Asp Asn 
405 

Asp Asn Gly Thr Glu Gly Ser Ala Val 
420 425 
Phe Ser Phe Thr Lys Asn Phe Ser Ala 

435 440 
Gly Gly Gly Asp Val Asp Gin Asp Trp 

450 455 
Tyr Thr Trp 
4 65 



190 

Gly Leu Ala Asd Asn Gly Gly 
205 

Phe Asn Gly Asp Asn Gly Thr 
220 

He Met Val Gly Val Asp Thr 
235 240 
He Val Gly Ala Ala Ala Gly 
250 255 
Ser Gly Gin Val Asp Gin Asp 
270 

Ala His Phe Ala Asn Asn Val 
285 

His Phe Asn Asn Asp Leu Ser 
300 

Asp Gly Ser Thr Asn Ser Asp 
315 320 
Tyr Asp Phe Lys Leu Gly Asp 
330 335 
Val Ser Gly Leu Phe Gin Ser 
350 

Met Lys Val Asp Gly Gin Ser 
365 

Val Asp Ala Gly Tyr Thr Phe 
380 

Pro Tyr Phe Lys Leu Ala Tyr 
395 400 
Asp Val Asn Gly Asp Ser He 
410 415 
Arg Val Gly Leu Gly Thr Gin 
430 

Tyr Thr Asp Ala Asn Tyr Leu 
445 

Ser Ala Asn Val Gly Val Lys 
4 60 



<210> 382 
<211> 222 
<212> PRT 
<213> E. Coli 





<400> 


382 




















Met 


Pro 


Val 


Lys 


Asp 


Leu Thr Gly 


He 


Thr 


Ala Lys Asp 


Ala 


Gin 


Met 


1 








5 








10 






15 




Leu 


Ser 


Val 


Val 


Lys 


Pro 


Leu Gin 


Glu 


Phe 


Gly Lys Leu 


Asp 


Lys 


Cys 








20 








25 






30 




Leu 


Ser 


Arg 


Tyr 


Gly Thr 


Arg Phe 


Glu 


Phe 


Asn Asn Glu 


Lys 


Gin 


Val 






35 








40 






45 






He 


Phe 


Ser 


Ser 


Asp 


Val 


Asn Asn 


Glu 


Asp 


Thr Phe Val 


He 


Leu 


Glu 




50 










55 




60 








Gly Val 


lie 


Ser 


Leu 


Arg Arg Glu 


Glu 


Asn 


Val Leu He Gly He Thr 


65 










70 








75 






80 


Gin 


Ala 


Pro 


Tyr 


He 


Met 


Gly Leu Ala 


Asp Gly Leu Met 


Lys 


Asn 


Asp 










85 








90 






95 


He 


Pro 


Tyr 


Lys 


Leu 


He 


Ser Glu 


Gly 


Asn 


Cys Thr Gly Tyr 


His 


Leu 








100 








105 






110 






Pro 


Ala 


Lys 


Gin 


Thr 


He 


Thr Leu 


He 


Glu 


Gin Asn Gin 


Leu Trp Arg 






115 








120 






125 
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Asp Ala 


Phe 


Tyr 


Trp 


Leu 


Ala Trp Gin 


130 










135 


Asp Val 


Gin 


Leu 


He 


Gly 


His Asn Ser 


145 








150 




Leu Leu 


Ser 


Met 


He Asp 


Trp Asn Glu 








165 






Val Met 


Asn 


Tyr 


He 


His 


Gin Arg Thr 






180 






185 


Ala Glu 


Val 


Leu 


Ala 


Ala 


Leu Arg Lys 




195 








200 


Lys Gly 


Lys 


Leu 


Val 


Ala 


He Asn Arg 


210 










215 



PCT/USOO/02200 



Asn Arg He Leu Glu Leu Arg 
140 

Tyr Glu Gin He Arg Ala Thr 
155 160 
Glu Leu Arg Ser Arg He Gly 
170 175 
Arg He Ser Arg Ser Val Val 
190 

Gly Gly Tyr He Glu Met Asn 
205 

Leu Pro Ser Glu Tyr 
220 



<210> 383 
<211> 84 
<212> PRT 
<213> E . Coli 

<400> 383 

Met Thr Asp Lys He Arg Thr Leu Gin Gly Arg Val Val Ser Asp Lys 

1 5 10 15 

Met Glu Lys Ser He Val Val Ala lie Glu Arg Phe Val Lys His Pro 

20 25 30 

He Tyr Gly Lys Phe He Lys Arg Thr Thr Lys Leu His Val His Asp 

35 40 4 5 

Glu Asn Asn Glu Cys Gly He Gly Asp Val Val Glu He Arg Glu Cys 

50 55 60 

Arg Pro Leu Ser Lys Thr Lys Ser Trp Thr Leu Val Arg Val Val Glu 
65 70 75 80 

Lys Ala Val Leu 



<210> 384 
<211> 63 
<212> PRT 
<213> E. Coli 

<400> 384 

Met Lys Ala Lys Glu Leu Arg Glu Lys Ser Val Glu Glu Leu Asn Thr 

15 10 15 

Glu Leu Leu Asn Leu Leu Arg Glu Gin Phe Asn Leu Arg Met Gin Ala 

20 25 30 

Ala Ser Gly Gin Leu Gin Gin Ser His Leu Leu Lys Gin Val Arg Arg 

35 40 45 

Asp Val Ala Arg Val Lys Thr Leu Leu Asn Glu Lys Ala Gly Ala 
50 55 60 



<210> 385 
<211> 136 
<212> PRT 
<213> E. Coli 

<400> 385 
Met Leu Gin Pro Lys Arg Thr Lys 

1 5 
Asn Arg Gly Leu Ala Gin Gly Thr 
20 

Leu Lys Ala Val Gly Arg Gly Arg 



Phe Arg Lys Met His Lys Gly Arg 

10 15 
Asp Val Ser Phe Gly Ser Phe Gly 
25 30 
Leu Thr Ala Arg Gin He Glu Ala 



-195- 



WO 00/44906 



PCTAJSOO/02200 













40 


Ala Arg 


Arg Ala 


Met 


Thr Arg Ala 












55 


j.±e 


Arg 


vai rne 


Pro Asp 


Lys Pro 










70 




Arg 


Met 


Gly Lys 


Gly 


Lys 


Gly Asn 








85 






Gin 


Pro 


Gly Lys 


Val 


Leu 


Tyr Glu 






100 








Ala 


Arg 


Glu Ala 


Phe 


Lys 


Leu Ala 






115 






120 


Thr 


Phe 


Val Thr 


Lys 


Thr 


Val Met 




130 








135 



45 

Val Lys Arg Gin Gly Lys He Trp 
60 

He Thr Glu Lys Pro Leu Ala Val 

"?5 80 
Val Glu Tyr Trp Val Ala Leu He 

90 95 
Met Asp Gly Val Pro Glu Glu Leu 
105 no 
Ala Ala Lys Leu Pro He Lys Thr 
125 



<210> 386 
<211> 233 
<212> PRT 
<213> E. Coli 

<400> 386 

Met Gly Gin Lys Val His Pro Asn Gly He Arg Leu Gly He Val Lvs 

1 5 10 15 

Pro Trp Asn Ser Thr Trp Phe Ala Asn Thr Lys Glu Phe Ala Asp Asn 

20 25 30 

Leu Asp Ser Asp Phe Lys Val Arg Gin Tyr Leu Thr Lys Glu Leu Ala 

35 40 45 

Lys Ala Ser Val Ser Arg He Val He Glu Arg Pro Ala Lys Ser He 

50 55 60 

Arg Val Thr He His Thr Ala Arg Pro Gly He Val He Gly Lys Lys 
65 70 75 80 

Gly Glu Asp Val Glu Lys Leu Arg Lys Val Val Ala Asp He Ala Gly 

85 90 95 

Val Pro Ala Gin He Asn He Ala Glu Val Arg Lys Pro Glu Leu Asp 

100 105 no 

Ala Lys Leu Val Ala Asp Ser He Thr Ser Gin Leu Glu Arq Ara Val 

115 120 125 

Met Phe Arg Arg Ala Met Lys Arg Ala Val Gin Asn Ala Met Arg Leu 

130 135 140 

Gly Ala Lys Gly He Lys Val Glu Val Ser Gly Arg Leu Gly Gly Ala 
145 150 155 " 160 

Glu He Ala Arg Thr Glu Trp Tyr Arg Glu Gly Arg Val Pro Leu His 

165 170 " 175 

Thr Leu Arg Ala Asp He Asp Tyr Asn Thr Ser Glu Ala His Thr Thr 

180 185 190 

Tyr Gly Val He Gly Val Lys Val Trp He Phe Lys Gly Glu He Leu 

195 200 205 

Gly Gly Met Ala Ala Val Glu Gin Pro Glu Lys Pro Ala Ala Gin Pro 

210 215 220 

Lys Lys Gin Gin Arg Lys Gly Arg Lys 
225 230 



<210> 387 
<211> 110 
<212> PRT 
<213> E. Coli 

<400> 387 
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Met Glu 


Thr 


He 


Ala 


Lys 


His 


Arg 


1 






5 








Val Arg 


Leu 


Val 


Ala Asp Leu 


He 






20 










Leu Asp 


He 
35 


Leu 


Thr 


Tyr 


Thr 


Asn 
40 


Lys Val 


Leu 


Glu 


Ser 


Ala 


He 


Ala 


50 










55 




Asp lie 


Asp Asp 


Leu 


Lys 


Val 


Thr 


65 








70 






Ser Met 


Lys 


Arg 


He 


Met 


Pro Arg 








85 








Leu Lys 


Arg 


Thr 
100 


Ser 


His 


He 


Thr 



His Ala Arg Ser Ser Ala Gin Lys 

10 15 
Arg Gly Lys Lys Val Ser Gin Ala 
25 30 
Lys Lys Ala Ala Val Leu Val Lys 
45 

Asn Ala Glu His Asn Asp Gly Ala 
60 

Lys lie Phe Val Asp Glu Gly Pro 

75 80 
Ala Lys Gly Arg Ala Asp Arg He 

90 95 
Val Val Val Ser Asp Arg 
105 110 



<210> 388 
<211> 92 
<212> PRT 
<213> E. Coli 

<400> 388 
Met Pro Arg Ser Leu Lys Lys Gly 

1 5 
Met Lys Val Glu Lys Ala Val Glu 
20 

Thr Trp Ser Arg Arg Ser Thr He 

35 40 
He Ala Val His Asn Gly Arg Gin 

50 55 
Glu Met Val Gly His Lys Leu Gly 
65 70 
Arg Gly His Ala Ala Asp Lys Lys 
85 



Pro Phe He Asp Leu His Leu Leu 

10 15 
Ser Gly Asp Lys Lys Pro Leu Arg 
25 30 
Phe Pro Asn Met He Gly Leu Thr 
45 

His Val Pro Val Phe Val Thr Asp 
60 

Glu Phe Ala Pro Thr Arg Thr Tyr 

75 80 
Ala Lys Lys Lys 
90 



<210> 389 
<211> 273 
<212> PRT 
<213> E. Coli 





<400> 


389 














Met 


Ala 


Val 


Val 


Lys 


Cys 


Lys 


Pro 


Thr Ser Pro Gly Arg Arg His Val 


1 








5 








10 


15 


Val 


Lys 


Val 


Val 


Asn 


Pro 


Glu 


Leu 


His Lys Gly Lys Pro 


Phe Ala Pro 








20 










25 


30 


Leu 


Leu 


Glu 


Lys 


Asn 


Ser 


Lys 


Ser 


Gly Gly Arg Asn Asn 


Asn Gly Arg 






35 










40 


45 


lie 


Thr 


Thr Arg 


His 


He 


Gly Gly Gly His Lys Gin Ala 


Tyr Arg lie 




50 










55 




60 


Val 


Asp 


Phe 


Lys 


Arg 


Asn 


Lys 


Asp 


Gly lie Pro Ala Val 


Val Glu Arg 


65 










70 






75 


80 


Leu 


Glu 


Tyr Asp 


Pro 


Asn 


Arg 


Ser 


Ala Asn He Ala Leu 


Val Leu Tyr 










85 








90 


95 


Lys 


Asp 


Gly 


Glu 


Arg Arg 


Tyr 


He 


Leu Ala Pro Lys Gly Leu Lys Ala 








100 










105 


110 


Gly Asp 


Gin 


He 


Gin 


Ser 


Gly Val 


Asp Ala Ala He Lys 


Pro Gly Asn 






115 










120 


125 


Thr 


Leu 


Pro 


Met 


Arg 


Asn 


lie 


Pro 


Val Gly Ser Thr Val 


His Asn Val 
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130 






135 




Glu Met 


Lys 


Pro Gly Lys 


Gly 


Gly 


145 




150 






Tyr Val 


Gin 


He Val Ala 


Arg 


Asp 






165 






Arg Ser 


Gly Glu Met Arg 


Lys 


Val 






180 






Gly Glu 


Val 


Gly Asn Ala Glu 


His 




195 






200 


Gly Ala 


Ala 


Arg Trp Arg 


Gly 


Val 


210 






215 




Met Asn 


Pro 


Val Asp His 


Pro 


His 


225 




230 






Gly Lys 


His 


Pro Val Thr 


Pro 


Trp 






245 






Thr Arg 


Ser 


Asn Lys Arg 


Thr 


Asp 






260 






Lys 











140 

Gin Leu Ala Arg Ser Ala Gly Thr 
155 160 
Gly Ala Tyr Val Thr Leu Arg Leu 

170 175 
Glu Ala Asp Cys Arg Ala Thr Leu 
185 190 
Met Leu Arg Val Leu Gly Lys Ala 
205 

Arg Pro Thr Val Arg Gly Thr Ala 
220 

Gly Gly Gly Glu Gly Arg Asn Phe 
235 240 
Gly Val Gin Thr Lys Gly Lys Lys 

250 255 
Lys Phe He Val Arg Arg Arg Ser 
265 270 



<210> 390 
<211> 100 
<212> PRT 
<213> E. Coli 

<400> 390 



Met He Arg Glu 


Glu Arg 


Leu 


Leu Lys 


Val 


Leu Arg Ala 


Pro 


His 


Val 


1 


5 






10 






15 




Ser Glu Lys Ala 


Ser Thr 


Ala 


Met Glu 


Lys 


Ser Asn Thr 


He 


Val 


Leu 


20 






25 






30 






Lys Val Ala Lys 


Asp Ala 


Thr 


Lys Ala 


Glu 


He Lys Ala 


Ala 


Val 


Gin 


35 






40 




45 








Lys Leu Phe Glu 


Val Glu 


Val 


Glu Val 


Val 


Asn Thr Leu 


Val 


Val 


Lys 


50 




55 






60 






Gly Lys Val Lys 


Arg His 


Gly 


Gin Arg 


He 


Gly Arg Arg Ser Asp 


Trp 


65 


70 








75 






80 


Lys Lys Ala Tyr 


Val Thr 


Leu 


Lys Glu Gly 


Gin Asn Leu 


Asp 


Phe 


Val 




85 






90 




95 




Gly Gly Ala Glu 



















100 



<210> 391 

<211> 201 

<212> PRT 

<213> E. Coli 

<400> 391 
Met Glu Leu Val Leu Lys Asp Ala 

1 5 
Thr Thr Phe Gly Arg Asp Phe Asn 
20 

Val Ala Tyr Ala Ala Gly Ala Arg 

35 40 
Arg Ala Glu Val Thr Gly Ser Gly 

50 55 
Thr Gly Arg Ala Arg Ser Gly Ser 
65 70 



Gin Ser Ala Leu Thr Val Ser Glu 

10 15 
Glu Ala Leu Val His Gin Val Val 
25 30 
Gin Gly Thr Arg Ala Gin Lys Thr 
45 

Lys Lys Pro Trp Arg Gin Lys Gly 
60 

lie Lys Ser Pro lie Trp Arg Ser 
75 80 
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Gly Gly 


Val 


Thr 


Phe 


Ala Ala Arg 








85 




Asn Lys 


Lys 


Met 


Tyr Arg Gly Ala 






100 






Val Arg 


Gin 


Asp 


Arg 


Leu He Val 




115 






120 


Pro Lys 


Thr 


Lys 


Leu 


Leu Ala Gin 


130 








135 


Asp Val 


Leu 


He 


He Thr Gly Glu 


145 








150 


Ala Arg 


Asn 


Leu 


His 


Lys Val Asp 








165 




Pro Val 


Ser 


Leu 


He 


Ala Phe Asp 






180 




Val Lys 


Gin 


Val 


Glu 


Glu Met Leu 




195 






200 



Pro Gin Asp His Ser Gin Lys Val 

90 95 
Leu Lys Ser He Leu Ser Glu Leu 
105 no 
Val Glu Lys Phe Ser Val Glu Ala 
125 

Lys Leu Lys Asp Met Ala Leu Glu 
140 

Leu Asp Glu Asn Leu Phe Leu Ala 
155 160 
Val Arg Asp Ala Thr Gly He Asp 

170 175 
Lys Val Val Met Thr Ala Asp Ala 
185 190 
Ala 



<210> 392 
<211> 209 
<212> PRT 
<213> E. Coli 

<400> 392 

Met He Gly Leu Val Gly Lys Lys Val Gly Met Thr Arg He Phe Thr 

1 5 10 15 

Glu Asp Gly Val Ser He Pro Val Thr Val He Glu Val Glu Ala Asn 

20 25 30 

Arg Val Thr Gin Val Lys Asp Leu Ala Asn Asp Gly Tyr Arg Ala He 

35 40 45 

Gin Val Thr Thr Gly Ala Lys Lys Ala Asn Arg Val Thr Lys Pro Glu 

50 55 60 

Ala Gly His Phe Ala Lys Ala Gly Val Glu Ala Gly Arg Gly Leu Trp 
65 70 75 ~ 80 

Glu Phe Arg Leu Ala Glu Gly Glu Glu Phe Thr Val Gly Gin Ser He 

85 90 95 

Ser Val Glu Leu Phe Ala Asp Val Lys Lys Val Asp Val Thr Gly Thr 

100 105 no 

Ser Lys Gly Lys Gly Phe Ala Gly Thr Val Lys Arg Trp Asn Phe Arg 

115 120 125 

Thr Gin Asp Ala Thr His Gly Asn Ser Leu Ser His Arg Val Pro Gly 

130 135 140 

Ser He Gly Gin Asn Gin Thr Pro Gly Lys Val Phe Lys Gly Lys Lys 
145 150 155 " 160 

Met Ala Gly Gin Met Gly Asn Glu Arg Val Thr Val Gin Ser Leu Asp 

165 170 175 

Val Val Arg Val Asp Ala Glu Arg Asn Leu Leu Leu Val Lys Gly Ala 

180 185 190 

Val Pro Gly Ala Thr Gly Ser Asp Leu He Val Lys Pro Ala Val Lys 
195 200 205 

Ala 



<210> 393 
<211> 103 
<212> PRT 
<213> E. Coli 
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<400> 393 

Met Gin Asn Gin Arg He Arg He Arg Leu Lys Ala Phe Asp His Arg 

15 10 15 

Leu He Asp Gin Ala Thr Ala Glu He Val Glu Thr Ala Lys Arg Thr 

20 25 30 

Gly Ala Gin Val Arg Gly Pro He Pro Leu Pro Thr Arg Lys Glu Arg 

35 4 0 45 

Phe Thr Val Leu He Ser Pro His Val Asn Lys Asp Ala Arg Asp Gin 

50 55 60 

Tyr Glu He Arg Thr His Leu Arg Leu Val Asp He Val Glu Pro Thr 
65 70 75 80 

Glu Lys Thr Val Asp Ala Leu Met Arg Leu Asp Leu Ala Ala Gly Val 

85 90 95 

Asp Val Gin He Ser Leu Gly 
100 



<210> 394 
<211> 118 
<212> PRT 
<213> E. Coli 

<400> 394 



Met Ala Arg Val 


Lys Arg 


Gly Val 


He 


Ala 


Arg Ala Arg His 


Lys 


Lys 


1 


5 






10 




15 


He Leu Lys Gin 


Ala Lys 


Gly Tyr 


Tyr 


Gly 


Ala Arg Ser Arg 


Val 


Tyr 


20 






25 




30 




Arg Val Ala Phe 


Gin Ala 


Val He 


Lys 


Ala 


Gly Gin Tyr Ala Tyr Arg 


35 




40 






45 






Asp Arg Arg Gin 


Arg Lys 


Arg Gin 


Phe 


Arg 


Gin Leu Trp He 


Ala 


Arg 


50 




55 






60 




He Asn Ala Ala 


Ala Arg 


Gin Asn Gly 


He 


Ser Tyr Ser Lys 


Phe 


He 


65 


70 








75 




80 


Asn Gly Leu Lys 


Lys Ala 


Ser Val 


Glu 


He 


Asp Arg Lys He 


Leu 


Ala 




85 






90 




95 




Asp He Ala Val 


Phe Asp 


Lys Val 


Ala 


Phe 


Thr Ala Leu Val 


Glu 


Lys 


100 






105 




110 




Ala Lys Ala Ala 


Leu Ala 















115 



<210> 395 
<211> 65 
<212> PRT 
<213> E. Coli 

<400> 395 

Met Pro Lys He Lys Thr Val Arg Gly Ala Ala Lys Arg Phe Lys Lys 

1 5 10 15 

Thr Gly Lys Gly Gly Phe Lys His Lys His Ala Asn Leu Arg His He 

20 25 30 

Leu Thr Lys Lys Ala Thr Lys Arg Lys Arg His Leu Arg Pro Lys Ala 

35 40 45 

Met Val Ser Lys Gly Asp Leu Gly Leu Val He Ala Cys Leu Pro Tyr 
50 55 60 

Ala 
65 
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<210> 396 
<211> 180 
<212> PRT 
<213> E. Coli 



<400> 


396 














Met Lys 


Gly Gly 


Lys Arg Val 


Gin Thr Ala Arg 


Pro 


Asn Arg 


He 


Asn 


1 






5 


10 




15 




Gly Glu 


lie Arg 


Ala Gin Glu 


Val Arg Leu Thr 


Gly 


Leu Glu 


Gly Glu 






20 




25 




30 






Gin Leu 


Gly 
35 


He 


Val Ser Leu 


Arg Glu Ala Leu 
40 


Glu 


Lys Ala 
45 


Glu 


Glu 


Ala Gly 


Val Asp 


Leu Val Glu 


He Ser Pro Asn 


Ala 


Glu Pro 


Pro 


Val 


50 






55 




60 








Cys Arg 


He 


Met 


Asp Tyr Gly 


Lys Phe Leu Tyr 


Glu 


Lys Ser 


Lys 


Ser 


65 






70 


75 




80 


Ser Lys 


Glu 


Gin 


Lys Lys Lys Gin Lys Val He 


Gin 


Val Lys 


Glu 


He 








85 


90 




95 




Lys Phe 


Arg 


Pro 


Gly Thr Asp 


Glu Gly Asp Tyr Gin Val Lys Leu Arg 






100 




105 




110 






Ser Leu 


He 


Arg 


Phe Leu Glu 


Glu Gly Asp Lys Ala Lys He 


Thr 


Leu 




115 






120 




125 






Arg Phe 


Arg 


Gly 


Arg Glu Met 


Ala His Gin Gin 


He 


Gly Met 


Glu 


Val 


130 






135 




140 






Leu Asn 


Arg 


Val 


Lys Asp Asp 


Leu Gin Glu Leu 


Ala 


Val Val 


Glu 


Ser 


145 






150 


155 








160 


Phe Pro 


Thr 


Lys 


He Glu Gly Arg Gin Met He 


Met 


Val Leu 


Ala 


Pro 








165 


170 






175 




Lys Lys 


Lys 


Gin 
180 















<210> 397 
<211> 642 
<212> PRT 
<213> E. Coli 

<400> 397 

Met Pro Val He Thr Leu Pro Asp Gly Ser Gin Arg His Tyr Asp His 

15 10 15 

Ala Val Ser Pro Met Asp Val Ala Leu Asp lie Gly Pro Gly Leu Ala 

20 25 30 

Lys Ala Cys He Ala Gly Arg Val Asn Gly Glu Leu Val Asp Ala Cys 

35 40 45 

Asp Leu He Glu Asn Asp Ala Gin Leu Ser He He Thr Ala Lys Asp 

50 55 60 

Glu Glu Gly Leu Glu lie lie Arg His Ser Cys Ala His Leu Leu Gly 
65 70 75 80 

His Ala lie Lys Gin Leu Trp Pro His Thr Lys Met Ala lie Gly Pro 

85 90 95 

Val lie Asp Asn Gly Phe Tyr Tyr Asp Val Asp Leu Asp Arg Thr Leu 

100 105 HO 

Thr Gin Glu Asp Val Glu Ala Leu Glu Lys Arg Met His Glu Leu Ala 

115 120 125 

Glu Lys Asn Tyr Asp Val lie Lys Lys Lys Val Ser Trp His Glu Ala 

130 135 140 

Arg Glu Thr Phe Ala Asn Arg Gly Glu Ser Tyr Lys Val Ser lie Leu 
145 150 155 ** 160 

Asp Glu Asn lie Ala His Asp Asp Lys Pro Gly Leu Tyr Phe His Glu 
165 170 ' 175 
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Glu Tyr Val Asp Met Cys Arg Gly Pro His Val Pro Asn Met Arg Phe 

180 185 190 

Cys His His Phe Lys Leu Met Lys Thr Ala Gly Ala Tyr Trp Arg Gly 

195 200 205 

Asp Ser Asn Asn Lys Met Leu Gin Arg He Tyr Gly Thr Ala Trp Ala 

210 215 220 

Asp Lys Lys Ala Leu Asn Ala Tyr Leu Gin Arg Leu Glu Glu Ala Ala 
225 230 235 240 

Lys Arg Asp His Arg Lys He Gly Lys Gin Leu Asp Leu Tyr His Met 

245 250 255 

Gin Glu Glu Ala Pro Gly Met Val Phe Trp His Asn Asp Gly Trp Thr 

260 265 270 

He Phe Arg Glu Leu Glu Val Phe Val Arg Ser Lys Leu Lys Glu Tyr 

275 280 " 285 

Gin Tyr Gin Glu Val Lys Gly Pro Phe Met Met Asp Arg Val Leu Trp 

290 295 300 

Glu Lys Thr Gly His Trp Asp Asn Tyr Lys Asp Ala Met Phe Thr Thr 
305 310 315 320 

Ser Ser Glu Asn Arg Glu Tyr Cys He Lys Pro Met Asn Cys Pro Gly 

325 330 335 

His Val Gin He Phe Asn Gin Gly Leu Lys Ser Tyr Arg Asp Leu Pro 

340 345 350 

Leu Arg Met Ala Glu Phe Gly Ser Cys His Arg Asn Glu Pro Ser Gly 

355 360 365 

Ser Leu His Gly Leu Met Arg Val Arg Gly Phe Thr Gin Asp Asp Ala 

370 375 380 

His He Phe Cys Thr Glu Glu Gin He Arg Asp Glu Val Asn Gly Cys 
385 390 395 400 

He Arg Leu Val Tyr Asp Met Tyr Ser Thr Phe Gly Phe Glu Lys He 

405 410 415 

Val Val Lys Leu Ser Thr Arg Pro Glu Lys Arg He Gly Ser Asp Glu 

420 425 430 

Met Trp Asp Arg Ala Glu Ala Asp Leu Ala Val Ala Leu Glu Glu Asn 

435 440 445 

Asn He Pro Phe Glu Tyr Gin Leu Gly Glu Gly Ala Phe Tyr Gly Pro 

450 455 460 

Lys He Glu Phe Thr Leu Tyr Asp Cys Leu Asp Arg Ala Trp Gin Cys 
465 470 475 ~ 480 

Gly Thr Val Gin Leu Asp Phe Ser Leu Pro Ser Arg Leu Ser Ala Ser 

485 490 495 

Tyr Val Gly Glu Asp Asn Glu Arg Lys Val Pro Val Met He His Arg 

500 505 510 

Ala He Leu Gly Ser Met Glu Arg Phe He Gly He Leu Thr Glu Glu 

515 520 525 

Phe Ala Gly Phe Phe Pro Thr Trp Leu Ala Pro Val Gin Val Val He 

530 535 540 

Met Asn He Thr Asp Ser Gin Ser Glu Tyr Val Asn Glu Leu Thr Gin 
545 550 555 560 

Lys Leu Ser Asn Ala Gly He Arg Val Lys Ala Asp Leu Arg Asn Glu 

565 570 575 

Lys He Gly Phe Lys He Arg Glu His Thr Leu Arg Arg Val Pro Tyr 

580 585 590 

Met Leu Val Cys Gly Asp Lys Glu Val Glu Ser Gly Lys Val Ala Val 

595 .600 605 

Arg Thr Arg Arg Gly Lys Asp Leu Gly Ser Met Asp Val Asn Glu Val 

610 615 620 

He Glu Lys Leu Gin Gin Glu He Arg Ser Arg Ser Leu Lys Gin Leu 
625 630 635 640 

Glu Glu 
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<210> 398 
<211> 450 
<212> PRT 
<213> E. Coli 

<400> 398 

Met Thr Lys His Tyr Asp Tyr lie Ala He Gly Gly Gly Ser Gly Gly 

1 5 10 15 

He Ala Ser He Asn Arg Ala Ala Met Tyr Gly Gin Lys Cys Ala Leu 

20 25 30 

He Glu Ala Lys Glu Leu Gly Gly Thr Cys Val Asn Val Gly Cys Val 

35 40 4 5 

Pro Lys Lys Val Met Trp His Ala Ala Gin He Arg Glu Ala He His 

50 55 60 

Met Tyr Gly Pro Asp Tyr Gly Phe Asp Thr Thr He Asn Lys Phe Asn 
65 70 75 80 

Trp Glu Thr Leu He Ala Ser Arg Thr Ala Tyr He Asp Arg He His 

85 90 95 

Thr Ser Tyr Glu Asn Val Leu Gly Lys Asn Asn Val Asp Val He Lys 

100 105 no 

Gly Phe Ala Arg Phe Val Asp Ala Lys Thr Leu Glu Val Asn Gly Glu 

115 120 125 

Thr lie Thr Ala Asp His lie Leu lie Ala Thr Gly Gly. Arg Pro Ser 

130 135 140 

His Pro Asp lie Pro Gly Val Glu Tyr Gly lie Asp Ser Asp Gly Phe 
145 150 155 160 

Phe Ala Leu Pro Ala Leu Pro Glu Arg Val Ala Val Val Gly Ala Gly 

165 170 175 

Tyr lie Ala Val Glu Leu Ala Gly Val lie Asn Gly Leu Gly Ala Lys 

180 185 190 

Thr His Leu Phe Val Arg Lys His Ala Pro Leu Arg Ser Phe Asp Pro 

195 200 205 

Met lie Ser Glu Thr Leu Val Glu Val Met Asn Ala Glu Gly Pro Gin 

210 215 220 

Leu His Thr Asn Ala lie Pro Lys Ala Val Val Lys Asn Thr Asp Gly 
225 230 235 240 

Ser Leu Thr Leu Glu Leu Glu Asp Gly Arg Ser Glu Thr Val Asp Cys 

245 250 255 

Leu lie Trp Ala He Gly Arg Glu Pro Ala Asn Asp Asn lie Asn Leu 

260 265 270 

Glu Ala Ala Gly Val Lys Thr Asn Glu Lys Gly Tyr lie Val Val Asp 

275 280 " 285 

Lys Tyr Gin Asn Thr Asn lie Glu Gly lie Tyr Ala Val Gly Asp Asn 

290 295 300 

Thr Gly Ala Val Glu Leu Thr Pro Val Ala Val Ala Ala Gly Arg Arg 
305 310 315 320 

Leu Ser Glu Arg Leu Phe Asn Asn Lys Pro Asp Glu His Leu Asp Tyr 

325 330 335 

Ser Asn lie Pro Thr Val Val Phe Ser His Pro Pro He Gly Thr Val 

340 345 350 

Gly Leu Thr Glu Pro Gin Ala Arg Glu Gin Tyr Gly Asp Asp Gin Val 

355 360 365 

Lys Val Tyr Lys Ser Ser Phe Thr Ala Met Tyr Thr Ala Val Thr Thr 

370 375 380 

His Arg Gin Pro Cys Arg Met Lys Leu Val Cys Val Gly Ser Glu Glu 
385 390 395 400 

Lys lie Val Gly lie His Gly lie Gly Phe Gly Met Asp Glu Met Leu 

405 410 415 

Gin Gly Phe Ala Val Ala Leu Lys Met Gly Ala Thr Lys Lys Asp Phe 
420 425 430 
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Asp Asn Thr 
435 

Met Arg 
450 



Val Ala lie His Pro Thr Ala Ala Glu Glu Phe Val Thr 
440 445 



<210> 399 
<211> 2894 
<212> RNA 
<213> E. Coli 

<400> 399 

aagguuaagc cucacgguuc auuaguaccg guuagcucaa cgcaucgcug cgcuuacaca 60 

cccggccuau caacgucguc gucuucaacg uuccuucagg acccuuaaag ggucagggag 120 

aacucaucuc ggggcaaguu ucgugcuuag augcuuucag cacuuaucuc uuccgcauuu 180 

agcuaccggg cagugccauu ggcaugacaa cccgaacacc agugaugcgu ccacuccggu 240 

ccucucguac uaggagcagc cccccucagu ucuccagcgc ccacggcaga uagggaccga 300 

acugucucac gacguucuaa acccagcucg cguaccacuu uaaauggcga acagccauac 360 

ccuugggacc uacuucagcc ccaggaugug augagccgac aucgaggugc caaacaccgc 420 

cgucgauaug aacucuuggg cgguaucagc cuguuauccc cggaguaccu uuuauccguu 480 

gagcgauggc ccuuccauuc agaaccaccg gaucacuaug accugcuuuc gcaccugcuc 540 

gcgccgucac gcucgcaguc aagcuggcuu augccauugc acuaaccucc ugauguccga 600 

ccaggauuag ccaaccuucg ugcuccuccg uuacucuuua ggaggagacc gccccaguca 660 

aacuacccac cagacacugu ccgcaacccg gauuacgggu caacguuaga acaucaaaca 720 

uuaaagggug guauuucaag gucggcucca ugcagacugg cguccacacu ucaaagccuc 780 

ccaccuaucc uacacaucaa ggcucaaugu ucagugucaa gcuauaguaa agguucacgg 840 

ggucuuuccg ucuugccgcg gguacacugc aucuucacag cgaguucaau uucacugagu 900 

cucgggugga gacagccugg ccaucauuac gccauucgug caggucggaa cuuacccgac 960 

aaggaauuuc gcuaccuuag gaccguuaua guuacggccg ccguuuaccg gggcuucgau 1020 

caagagcuuc gcuugcgcua accccaucaa uuaaccuucc ggcaccgggc aggcgucaca 1080 

ccguauacgu ccacuuucgu guuugcacag ugcuguguuu uuaauaaaca guugcagcca 1140 

gcugguaucu ucgacugauu ucagcuccau ccgcgaggga ccucaccuac auaucagcgu 1200 

gccuucuccc gaaguuacgg caccauuuug ccuaguuccu ucacccgagu ucucucaagc 1260 

gccuugguau ucucuaccug accaccugug ucgguuuggg guacgauuug auguuaccug 1320 

augcuuagag gcuuuuccug gaagcagggc auuuguugcu ucagcaccgu agugccucgu 1380 

caucacgccu cagccuugau uuuccggauu ugccuggaaa accagccuac acgcuuaaac 1440 

cgggacaacc gucgcccggc caacauagcc uucuccgucc ccccuucgca guaacaccaa 1500 

guacaggaau auuaaccugu uucccaucga cuacgccuuu cggccucgcc uuaggggucg 1560 

acucacccug ccccgauuaa cguuggacag gaacccuugg ucuuccggcg agcgggcuuu 1620 

ucacccgcuu uaucguuacu uaugucagca uucgcacuuc ugauaccucc agcaugccuc 1680 

acagcacacc uucgcaggcu uacagaacgc uccccuaccc aacaacgcau aagcgucgcu 174 0 

gccgcagcuu cggugcaugg uuuagccccg uuacaucuuc cgcgcaggcc gacucgacca 1800 

gugagcuauu acgcuuucuu uaaaugaugg cugcuucuaa gccaacaucc uggcugucug 1860 

ggccuuccca caucguuucc cacuuaacca ugacuuuggg accuuagcug gcggucuggg 1920 

uuguuucccu cuucacgacg gacguuagca cccgccgugu gucucccgug auaacauucu 1980 

ccgguauucg caguuugcau cggguuggua agucgggaug acccccuugc cgaaacagug 2040 

cucuaccccc ggagaugaau ucacgaggcg cuaccuaaau agcuuucggg gagaaccagc 2100 

uaucucccgg uuugauuggc cuuucacccc cagccacaag ucauccgcua auuuuucaac 2160 

auuagucggu ucgguccucc aguuaguguu acccaaccuu caaccugccc auggcuagau 2220 

caccggguuu cgggucuaua cccugcaacu uaacgcccag uuaagacucg guuucccuuc 2280 

ggcuccccua uucgguuaac cuugcuacag aauauaaguc gcugacccau uauacaaaag 2340 

guacgcaguc acacgccuaa gcgugcuccc acugcuugua cguacacggu uucagguucu 2400 

uuuucacucc ccucgccggg guucuuuucg ccuuucccuc acgguacugg uucacuaucg 24 60 

gucagucagg aguauuuagc cuuggaggau ggucccccca uauucagaca ggauaccacg 2520 

ugucccgccc uacucaucga gcucacagca ugugcauuuu uguguacggg gcugucaccc 2580 

uguaucgcgc gccuuuccag acgcuuccac uaacacacac acugauucag gcucugggcu 2640 

gcuccccguu cgcucgccgc uacuggggga aucucgguug auuucuuuuc cucgggguac 2700 

uuagauguuu caguuccccc gguucgccuc auuaaccuau ggauucaguu aaugauagug 2760 

ugucgaaaca cacuggguuu ccccauucgg aaaucgccgg uuauaacggu ucauaucacc 2820 

uuaccgacgc uuaucgcaga uuagcacguc cuucaucgcc ucugacugcc agggcaucca 2880 
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ccguguacgc uuagucgcuu aacc 



2894 



<210> 400 
<211> 120 
<212> RNA 
<213> E. Coli 

<400> 400 

augccuggca guucccuacu cucgcauggg gagaccccac acuaccaucg gcgcuacggc 60 
guuucacuuc ugaguucggc auggggucag gugggaccac cgcgcuacgg ccgccaggca 120 



<210> 401 
<211> 76 
<212> RNA 
<213> E. Coli 

<400> 401 

guccccuucg ucuagaggcc caggacaccg cccuuucacg gcgguaacag ggguucgaau 
ccccuagggg acgcca 



60 
76 



<210> 402 
<211> 1549 
<212> RNA 
<213> E. Coli 



<400> 402 

aaauugaaga guuugaucau ggcucagauu gaacgcuggc ggcaggccua 
gucgaacggu aacaggaagc agcuugcugc uucgcugacg aguggcggac 
ugucugggaa gcugccugau ggagggggau aacuacugga aacgguagcu 
aaugucgcaa gaccaaagag ggggaccuuc gggccucuug ccaucggaug 
ggauuagcuu guuggugggg uaacggcuca ccaaggcgac gaucccuagc 
ggaugaccag ccacacugga acugagacac gguccagacu ccuacgggag 
ggaauauugc acaaugggcg caagccugau gcagccaugc cgcguguaug 
ucggguugua aaguacuuuc agcggggagg aagggaguaa aguuaauacc 
gacguuaccc gcagaagaag caccggcuaa cuccgugcca gcagccgcgg 
ggugcaagcg uuaaucggaa uuacugggcg uaaagcgcac gcaggcgggu 
agaugugaaa uccccgggcu caaccuggga acugcaucug auacuggcaa 
cguagagggg gguagaauuc cagguguagc ggugaaaugc guagagaucu 
cgguggcgaa ggcggccccc uggacgaaga cugacgcuca ggugcgaaag 
aaacaggauu agauacccug guaguccacg ccguaaacga ugucgacuug 
ccuugaggcg uggcuuccgg agcuaacgcg uuaagucgac cgccugggga 
aagguuaaaa cucaaaugaa uugacggggg cccgcacaag cgguggagca 
uucgaugcaa cgcgaagaac cuuaccuggu cuugacaucc acggaaguuu 
gaaugugccu ucgggaaccg ugagacaggu gcugcauggc ugucgucagc 
aaauguuggg uuaagucccg caacgagcgc aacccuuauc cuuuguugcc 
ccgggaacuc aaaggagacu gccagugaua aacuggagga agguggggau 
caucauggcc cuuacgacca gggcuacaca cgugcuacaa uggcgcauac 
gaccucgcga gagcaagcgg accucauaaa gugcgucgua guccggauug 
cucgacucca ugaagucgga aucgcuagua aucguggauc agaaugccac 
uucccgggcc uuguacacac cgcccgucac accaugggag uggguugcaa 
uagcuuaacc uucgggaggg cgcuuaccac uuugugauuc augacugggg 
acaagguaac cguaggggaa ccugcgguug gaucaccucc uuaccuuaa 



acacaugcaa 
gggugaguaa 
aauaccgcau 
ugcccagaug 
uggucugaga 
gcagcagugg 
aagaaggccu 
uuugcucauu 
uaauacggag 
ugguuaaguc 
gcuugagucu 
ggaggaauac 
cguggggagc 
gagguugugc 
guacggccgc 
ugugguuuaa 
ucagagauga 
ucguguugug 
agcgguccgg 
gacgucaagu 
aaagagaagc 
gagucugcaa 
ggugaauacg 
aagaaguagg 
ugaagucgua 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1549 
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<210> 403 
<211> 17 
<212> DNA 
<213> Artificial 
<220> 

<223> Primer Oligonucleotide 



<400> 403 
tgtttatcag accgctt 

<210> 404 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer Oligonucleotide 

<400> 404 
acaatttcac acagcctc 



<210> 405 
<211> 159 
<212> DNA 

<213> Escherichia coli 



<400> 405 

caggtggtat ggaaacccaa aatggagacg ggaagctgaa ccagatagtt actggaggtg 60 

atcaccagca gatgaaataa cgataaccag aacaacgcct tatagcgttg agtttgcgag 120 

aaaacgttca tattgtacct ttttgattaa ccattgggg ~ " ~ 159 

<210> 406 
<211> 640 
<212> DNA 

<213> Escherichia coli 



<220> 

<221> misc_feature 
<222> (i) . . . (640) 
<223> n = A,T,C or G 



<400> 406 

ggggnccaaa gtgtttgggn cgggcaactg gaggccaacc ttaanttngg ggaaattttt 60 

aanaaaaggc ggggatttgt nagccacggg ngattanttt anaataaatt aagtgttgcc 120 

ataaggggac aaagngaagg aagtggntat taanggannc gccaatgcga nttagggcag 180 

accattcggc cattcgcctt cttggttatc gaagttcatc cagatagccg ttgccngacc 240 

gaccagattc gcttcnggca caaagcccca gtaacggctg tccgcgctgt tgtcgcggtt 300 

gtcgccca-c atgaagtatt gtcccggagg aacaatccag gttgccagtt gttgccctgg 360 

ctgctggtaa tacatcccca cctgatcctg cgcaatcggc actgtcagaa tgcggtgcgt 420 

cacatcaccc agtgtctctt tacgctcgga aagacgaatt ccattttctt tggtttcgtt 480 

tttcggcacn tcaaagaatc cgctggtcgc ttccccacca ttacggcgtg agaaggtctg 540 

aacgaaatcg ctcggttcca cgtttgagta ggtgaccggc agcgcgtttt cacacgcctg 600 

gccggaac-g catcccggtt gaatcgtcag ctcttttgag 64 0 

<210> 4 07 
<211> 682 
<212> DNA 

<213> Escherichia coli 



<220> 

<221> misc feature 
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<222> (1) . . . (682) 
<223> n = A, T, C or G 

<400> 407 

cctgcagggt aatgtcgcca ttaaactggc gcaggcagcc aaagagttgc tccgcttcta 60 

cccagtcggc agcgacaact tgcgttaaag tcgcaaaatt atcatctgca ctcactgcgt 120 

gacgtaagcg gatggagtgg ccggaaacct catagtgacc gcccaccagt tggcctgcat 180 

cgctttgtag cgtacgcgcg gcattggcaa taagattcag atactcagac tcttccgggg 240 

ccttcgccag cataaaagag gaggatgctc gcgtatgcag caactgctcc agcgcaaatt 300 

gcagccgcgg ttgagtatca ctgaataaag gatcgttttc gtcaatcaaa tgtggctgag 360 

caaatatttc ctgatagcta tcggtatcag gaaccaggtc acgccatgca agtttcgtaa 420 

tggtcaaagt tgatgttttt tagtctgttg tcaaagccgc nattataccn gtaaccggca 480 

ctacagcaca cgtagaaagc acccgacaat actcctggca tgggcgttaa agctcacagg 540 

atggagatct tttcttcact ggcctaaaaa gctgatattc tgtaaagagt tacacngtaa 600 
cattgagatc gctatgaaat atcaacaact tggaaaatct tgnaaagcng gttggaaaat 
ggaaagtatc tggttaagaa gc 



660 
682 



<210> 408 
<211> 309 
<212> DNA 

<213> Escherichia coli 
<400> 408 

ggggatccgg cagaatttta cgctgaccaa tgacgcgacg acgtggcatg gaaatactcc 60 
gttgttaatt caggattgtc caaaactcta cgagtttagt ttgacattta agttaaaacg 120 
tttggcctta cttaacggag aaccattaag ccttaggacg cttcacgcca tacttggaac 180 
gagcctgctt acggtcttta acgccggagc agtcaagcgc accacgtacg gtgtggtaac 240 
gaacacccgg gaggtcttta acacgaccgc cacggatcag gatcacggag tgctcctgca 300 
gccaagctt " 

<210> 409 
<211> 1167 
<212> DNA 

<213> Escherichia coli 
<400> 409 

gtcgacccat ctgtccattg agcggacagt ttgtgcaaca ctattttgtt gaccggaaaa 60 

tggaacactt tccgcaatgc ctgttgctat cacgcttaaa ccatttcatt gcgatttaca 120 

cagaacggac gtcctgtcgc agtatattaa gtcgtcgata gaaacaagca ttgaaaggca 180 

cagcagtagt caaacagtgt gaaacgctac tggcgcctta cagcgcaaaa aggctggtga 240 

ctaaaaagtc accagccatc agcctgattt ctcaggctgc aaccggaagg gttggcttat 300 

ttaacttcaa cttcagcgcc agcttcttcc agagcttttt tcagtgcttc tgcgtcgtct 360 

ttgctcacgc cttctttcag agcagccggt gcagattcta ccaggtcttt agcttctttc 420 

agacccaggc cagttgcgcc acgtactgct ttgataacag caactttgtt agcgccagca 480 

gctttcagaa ttacgtcgaa ttcagttttt tcttcagcag cttcaaccgg gccagcagct 540 

acagctacag cagcagcagc ggaaacaccg aatttttctt ycattgcaga gatcaagttc 600 

tacaacgtcc attacagaca tagctgcaac tgcttcaatg awttgatctt tagtgataga 660 

catttaaatk gttcctgaat atcagaataa gtttatacgt aagcgaatgc gttaaaaaga 720 

taactgcgaw taagcagctt ytttcgcatc gcgtacagma gccagagtac gaaccagttt 780 

gccagccgaa gcttctttca tggttgccat caggcgtgca attgcttctt cgtaggtcgg 840 

cagagttgcc aggcggtcga tctgagacgc cgggatcagc tcaccttcaa aggcagcggc 900 

tttgacctca aattttgcat tcgctttcgc gaactctttg aacagacgag cagcagcgcc 960 

cgggtgttcc atagagtatg caatcagggt cggaccaaca aacgcgtctt tcaggcactc 1020 

gaacggagta ccttcaacag cacggcgcag cagggtgtta cgaacaacac gcatgtatac 1080 

gccagcttcg cgaccrgctt tacgcagttc agtcatttta tctacagtta cgcccacggg 1140 

aatccgcaac tactgcaagc caagctt " 1167 

<210> 410 
<211> 404 
<212> DNA 

<213> Escherichia coli 
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<400> 410 

caaanctatt ttgktggacc ggaaaakgga acactttccg cawkgcctgt tgctatcacg 60 

cttaaaccat ttcattgcga tttacacaga acggacgtcc tgtcgcagta tattaagtcg 120 

tcgatagaaa caagcattga aaggcacagc agtagtcaaa cagtgtgaaa cgctactggc 180 

gccttacagc gcaaaaaggc tggtgactaa aaagtcacca gccatcagcc tgatttctca 240 

ggctgcaacc ggaagggttg gcttatttaa cttcaacttc agcgccagct tcttccagag 300 

cttttttcag tgcttctgcg tcgtctttgc tcacgccttc tttcagagca gccggtgcag 360 

attctaccag gtctttagct tctttcagac ccaggccagt tgcg 404 

<210> 411 
<211> 152 
<212> DNA 

<213> Escherichia coli 
<400> 411 

agagcttttt tcagtgcttc tgcgtcgtct ttgctcacgc cttctttcaa gagcagcccg 60 
gtgcagattc taccaggtct ttagcttctt tcagacccag gccagttgcg ccacgtactg 120 
ctttgataac agcaactttg ttagcgccag ca a 152 

<210> 412 
<211> 825 
<212> DNA 

<213> Escherichia coli 
<220> 

<221> misc_f eature 
<222> (1) . . . (825) 
<223> n = A,T,C or G 

<400> 412 

gatccgtcga cccatctgtc cattgagcgg acagtttgtg caacactatt ttgttgaccg 



gaaaatggaa cactttccgc aatgcctgtt gctatcacgc ttaamccatt tcattgcgat 
ttacacagaa cggacgtcct gtcgcagtat attaagtcgt cgatagaaac aagcattgaa 



60 
120 

- - zj ~3 *- -^j-i-u^uuu^ uay^aLLyaa 180 

aggcacagca gtagtcaaac agtgtgaaac gctactggcg ccttacagcg caaaaaggct 240 

ggtgactaaa aagtcaccag ccatcagcct gatttctcag gctgcaaccg gaagggttgg 300 

cttatttaac ttcaacttca gcgccagctt cttccagagc ttttttcagt gcttctgcgt 360 

cgtctttgct cacgccttct ttcagagcag ccgggtgcag attctaccag gtctttagct 420 

tctttcagac ccaggccagt tgcgccacgt actgctttga taacagcaac tttgttagcg 480 

ccagcagctt tcagaattac gtcgaattca agttttttct tcagcagctt caaccgggcc 540 

agcagctaca gctacagcag cagcagcgga aacaccgaat ttttcttyca ttggcagaga 600 

tcaagttcta caacgtccat tacagacata gctgcaactg cttcaatgat tkgatcttwa 660 

gtgatagaca tttaaattgt tcctgaatat cagaataagt ttatacgtaa gcgaatgcgt 720 

taaaaagata actgcgatta agcagcttct ttcgcatcgc gtacagcagc cagaggtcga 780 

accagtttgc cagccgaagg ttggcttttc agcctnnncn natta " " 825 

<210> 413 
<211> 425 
<212> DNA 

<213> Escherichia coli 
<400> 413 

agtagtcaaa caggtgkgra acgctactgg cgccttacag cgcaaaaagg ctggtgacta 60 

aaaagtcacc agccatcarc ctgatttctc aggctgcaac ccggaagggt tggcttattt 120 

aacttcaact tcagcgccag cttcttccag agcttttttc agtgcttctg cgtcgtcttt 180 

gctcacgcct tctttcagag cagccggtgc agattctacc aggtctttag cttctttcag 240 

acccaggcca gttgcgccac gtactgcttt gataacagca actttgttag cgccagcagc 300 

tttcagaatt acgtcgaatt cagttttttc ttcagcagct tcaaccgggc cagcagctac 360 

agctacagca gcagcagcgg aaacacccga atttttcttc cattgcagag atcaagttct 420 

acaac ' 425 
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<210> 414 
<211> 126 
<212> DNA 

<213> Escherichia coli 
<400> 414 

agagcttttt tcagtgcttc tgcgtcgtct ttgctcacgc cttctttcag agcagccggt 60 
gcagattcta ccaggtcttt agcttctttc agacccaggc cagttgcgcc acgtactgct 120 
ttrata ' 126 

<210> 415 
<211> 264 
<212> DNA 

<213> Escherichia coli 
<400> 415 

ctgcmacccg gargggttgg cttatttaac ttcaacttca gcgccagctt cttycagagc 60 
ttttttcaag tgcttctgcg tcgtctttgc tcacgccttc tttcagagca gccggtgcag 120 
attctaccag gtctttagct tctttcagac ccaggccagt tgcgccacgt actgctttga 180 
taacagcaac tttgttagcg ccagcagctt tcagaattac gtcgaattca gttttttctt 240 
cagcagcttc aaccgggcca gcag 264 

<210> 416 
<211> 201 
<212> DNA 

<213> Escherichia coli 
<400> 416 

cgcataccct gcagcatcgg cccgatggag atcaggtcgg cagaacgctg taccgctttg 60 
taggtggtgt taccggtgtt cagatccggg aagatgaaca cggtagcgcg acctgcaacc 120 
ggagagttcg gcgctttgga tttcgcaacg tcagccatta ccgcagcgtc gtactgcagc 180 
ggaccgtcga tcatcaggtc a " ' ~ 2 01 

<210> 417 
<211> 239 
<212> DNA 

<213> Escherichia coli 
<400> 417 

aattcagcag ttgacagtgg cataaacgta actggtgact tttgcccggc atgacgccgg 60 
gcttttttta ttattccgtg acttccagcg tagtgaaggc aaacttctcg ccatcaaata 120 
gcccctgact ggttagtttt agcgcgggga tcactggcag agaaagaaac gccatctgaa 180 
taaacggctc atcgggtaac ggaccgcatt cacgggcggc ggctttcaag gcgtcaatt 239 

<210> 418 
<211> 223 
<212> DNA 

<213> Escherichia coli 
<400> 418 

ttcttttttt cgtcaacggt gtccagaatc attttattta cctcgggtac ttatgctgat 60 
ttttattatt atggggaagg tgttatttat gagtttcatt tatgccgtaa cgacaatgaa 120 
ctcgggaatt agtataagca gcgcgagaat aataatcatt gtgcaaatgc taatttaatt 180 
aatactattt aaatattatt ttgagcatat gcacataagg ttg 223 

<210> 419 
<211> 223 
<212> DNA 

<213> Escherichia coli 
<400> 419 
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ttctrttttt cgtcaacggt gtccagaatc attttattta cctcgggtac ttatgctgat 60 

ttttattatt atggggaagg tgttatttat gagtttcatt tatgccgtaa cgacaatgaa 120 

ctcgggaatt agtataagca gcgcgagaat aataatcatt gtgcaaatgc taatttaatt 180 

aatactattt aaatattatt ttgagcatat gcacataagg ttg 223 

<210> 420 
<211> 212 
<212> DNA 

<213> Escherichia coli 
<400> 420 

aatagcgggt atgcacgcct ttcttttttt cgtcaacggt gtccagaatc attttattta 60 
cctcgggtac ttatgctgat ttttattatt atggggaagg tgttatttat gagtttcatt 120 
tatgccgtaa cgmcaatgaa ctcgggaatt agtataagca gcgcgagaat aataatcatt 180 
gtgcaaatgc taatttaatt aatactattt aa 212 

<210> 421 
<211> 438 
<212> DNA 

<213> Escherichia coli 
<400> 421 

ccctctaaat tatcgcccgt ggcataaaaa ctgcgtccaa acgccgtctt tgccagcagc 60 

caggccataa atgccaccag aattatcgtc aaccaaccaa ttgctgaaac gccaagcagc 120 

agcggggcgg agagctgttt cagttcggcg ggtaaccctt caatccattt gccgccagtc 180 

cacagcaaca tgatgcctct gtacaaccct aacgtgccaa gggtggcaac aatggcaggg 240 

atctttagcc acgcgaccag gacaccgttg aaaaatcccg cgagcaaacc aagcagtaaa 300 

gtcgcgacac aagcaacagg tagtgaatat cctgcgttca gtaacatccc caacagcacc 360 

gcgcacattc cgggtaatcg aaccccactt gaaacatcaa tattgsgsgt aagcattwcc 420 

aagcgttcgs gcccatkg " " 438 

<210> 422 
<211> 682 
<212> DNA 

<213> Escherichia coli 
<400> 422 

aattcccggg gatccgtcga ccgtgcgctt ccggttgtgg caacccgcga aatggcgcgg 60 

cggtaagtat ggcggggtta ttccttcccc gttgaggaca ccgggttgtc aggttgacca 120 

tacgcttaag tgacaacccc gctgcaacgc cctctgttat caattttctg gtgacgtttg 180 

gcggtatcag ttttactccg tgactgctct gccgcccttt ttaaagtgaa ttttgtgatg 240 

tggtgaatgc ggctgagcgc acgcggaaca gttaaaacca aaaacagtgt tatgggtgga 300 

ttctctgtat ccggcgttaa ttgttaactg gttaacgtca cctggaggca ccaggcactg 360 

catcacaaaa ttcattgttg aggacgcgat aatgaaaacg ttattaccaa acgttaatac 420 

gtctgaaggt tgttttgaaa ttggtgtcac tatcagtaac ccagtattta ctgaagatgc 480 

cattaacaag agaaaacaag aacgggagct attaaataaa atatgcattg tttcaatgct 540 

ggctcgttta cgtctgatgc caaaaggatg tgcacaatga attcagcatt tgtgcttgtt 600 

ctgacagttt ttcttgtttc cggagagcca gttgatattg cagtcagtgt tcacaggaca 660 

atgcaggagt gatgactgca gc ~ 68 2 

<210> 423 
<211> 600 
<212> DNA 

<213> Escherichia coli 
<400> 423 

ggggatccga ttgtgactgc tctgccgccc tttttaaagt gaattttgtg atgtggtgaa 60 
tgcgcctgag cgcacgcgga acagttaaaa ccaaaaacag tgttatgggt ggattctctg 120 
tatccggcgt taattgttaa ctggttaacg tcacctggag gcaccaggca ctgcatcaca 180 
aaattcattg ttgaggacgc gataatgaaa acgttattac caaacgttaa tacgtctgaa 240 
ggttcttttg aaattggtgt cactatcagt aacccagtat ttactgaaga tgccattaac 300 
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aagagaaaac aagaacggga gctattaaat aaaatatgca ttgtttcaat gctggctcgt 360 

ttacgtctga tgccaaaagg atgtgcacaa tgaattcagc atttgtgctt gttctgacag 420 

tttttcttgt ttccggagag ccagttgata ttgcagtcag tgttcacagg acaatgcagg 480 

agtgtatgac tgcagcaacc gaacagaaaa ttcccggtaa ctgttacccg gtcgataaag 540 

ttattcacca ggataatatc gaaatcccgg caggtcttta aacagttccg taataaataa 600 



<210> 424 
<211> 100 
<212> DNA 

<213> Escherichia coli 



<400> 424 

gggatccagc aagaagatgc ggttgtaccg tcatcacgca gatgcgcaaa gctactcagc 
aactgacctt tcttcgcaat aagcacgcca ttagcgtcat 

<210> 425 
<211> 465 
<212> DNA 

<213> Escherichia coli 



<400> 425 

tcgcgtgttt 

gcggtttacc 

cgaggtgcag 

cacggaaaga 

cagctttggt 

ccagaatgat 

cgatagcagc 

gcatctgaac 



accttcaaca 
tacgttcgct 
ttcgcccata 
cgggtcttct 
tttcggttca 
cggcgcatcc 
agcgatgtcg 
gatacgaccg 



tcggtaactt 
tcaacgttga 
cccgcgatga 
ttagccagac 
actgcgatgg 
gggtcacaca 
cccgcgcgaa 
aaacgctcac 



tctggcggat 
attcacgctt 
tggtctggtt 
ggcccagagc 
agattaccgg 
gggtgtcacc 
cttctttgat 
gtgcagcttt 



agtttcacgg 
catacggtca 
agattcttcg 
cagacccatt 
ctcagggaat 
agtggttacg 
ctcttcacgt 
cacgg 



taagcaacct 
acgatgatgt 
tcagtccata 
ttttcctggt 
tccatacgtt 
tctttcagac 
ttgttagcgt 



<210> 426 
<211> 653 
<212> DNA 

<213> Escherichia coli 



60 
120 
180 
240 
300 
360 
420 
465 



<220> 

<221> misc_feature 
<222> (1) . . . (653) 
<223> n =» A,T,C or G 

<400> 426 

tgatcggctc aagcagaact ggtttcgctt tcttaaagcc ttctttaaag gcgatagaag 60 

cagccagttt aaacgccagt tcagaggagt caacgtcatg gtaagaaccg aagtgcagac 120 

gaatacccat gtctactacc gggtagcctg ccagcggacc tgctttcagc tgttcctgga 180 

tacctttatc aacggccggg atgtattcgc cagggattac accaccttta atgtcgttga 240 

tgaactcgta gcctttcggg tttgaacccg gctccagcgg gtacatgtcg ataacaacat 300 

gaccatactg accacgacca ccagactgtt tcgcgtgttt accttcaaca tcggtaactt 360 

tctggcggat agtttcacgg taagcaacct gcggtttacc tacgttcgct tcaacgttga 420 

attcacgctt catacggtca acgatgatgt cgaggtgcag ttcgccatac ccgcgatgat 480 

ggctgggtag attcttcgtc agtccataca cggnaagacg ggtcttnttt agccagacgg 540 

gccagagnca gacccatttt tttctggcag ctttggnttc ggtcaactgc gatggaaata 600 

cccggctcaa ggaattcata cgtttcanaa tgatcggggc attccgggtc aca " 653 

<210> 427 
<211> 268 
<212> DNA 

<213> Escherichia coli 



<400> 427 

ctttcttaaa gccttcttta aaggcgatag aagcagccag tttaaacgcc agttcagagg 60 
agtcaacgtc atggtaagaa ccgaagtgca gacgaatacc catgtctact accgggtagc 120 
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ctgccagcgg acctgctttc agctgttcct ggataccttt atcaacggcc gggatgtatt 180 
cgccagggat tacaccacct ttaatgtcgt tgatgaactc gtagcctttc gggtttgaac 240 
ccggctccag cgggtacatg tcgataac 268 

<210> 428 
<211> 330 
<212> DNA 

<213> Escherichia coli 



<400> 428 

gttttgggga gatgtaaggg ctaatctgaa tggctgcatt ccttgtttaa ggaaaaacga 60 

atgactgatt gccgatacct gattaaacgg gtcatcaaaa tcatcattgc tgttttacag 120 

ctgatccttc tgttcttata acacaaggaa acgtacttaa ggtgcgtccg gtgaaccagt 180 

cggacgcacc tttaataact ataaataagt gtctgggcag atactatata aattaactta 240 

gtgaatgatt atgctaatgt catcaattaa ataaatataa tggcgttaag gcttcccagt 300 

aatataatta atactctact tccagagtag " 3 3 q 

<210> 429 
<211> 465 
<212> DNA 

<213> Escherichia coli 



<400> 429 

gttttgggga gatgtaaggg ctaatctgaa tggctgcatt ccttgtttaa ggaaaaacga 60 

atgactgatt gccgatacct gattaaacgg gtcatcaaaa tcatcattgc tgttttacag 120 

ctgatccttc tgttcttata acacaaggaa acgtacttaa ggtgcgtccg gtgaaccagt 180 

cggacgcacc tttaataact ataaataagt gtctgggcag atactatata aattaactta 240 

gtgaatgatt atgctaatgt catcaattaa ataaatataa tggcgttaag gcttcccagt 300 

aatataatta atactctact tccagagtag aatattaaat tttatccgcg tggtgcatca 360 

gcacaaattt atcccacaac tgttcttctg tctcgacatg cgccggatct ttcacaatag 420 

tattggggat cgggcacacc ttctggcagg ttggtgtctc gtagt 4 65 

<210> 430 
<211> 379 
<212> DNA 

<213> Escherichia coli 
<400> 430 

aatctgaatg gctgcattcc ttgtttaagg aaaaacgaat gactgattgc cgatacctga 60 
ttaaacgggt catcaaaatc atcattgctg ttttacagct gatccttctg ttcttataac 120 
acaaggaaac gtacttaagg tgcgtccggt gaaccagtcg gacgcacctt taataactat 180 
aaataagtgt ctgggcagat actatataaa ttaacttagt gaatgattat gctaatgtca 240 
tcaattaaat aaatataatg gcgttaaggc ttcccagtaa tataattaat actctacttc 300 
cagagtagaa tattaaattt tatccgcgtg gtgcatcagc acaaatttat cccacaactg 360 
ttcttctgtc tcgacatgc 379 

<210> 431 
<211> 443 
<212> DNA 

<213> Escherichia coli 



<400> 431 

aagatgatgt gatgagaaag tcaatttgaa taagacaata ttaagagcta aaaaaatgtc 60 

aaaaaacact aaatcaaaaa ataatggcat tagaaaatat aatgcgaaaa cggaggtgaa 120 

attagtttat ttcaaatgag gaaaatctcc cggcgaaaaa accgggagat gaaagtgtga 180 

tgggtatcaa ataaacaaca gaggagaaat ttttaacgca gccattcagg caaatcgttt 240 

aatcccattg cctggcggat aagttgcggc ttaacgccag gaagcgtgtc ggccagtttc 300 

aaaccaatat cacgcagcag ttttttcgcc ggattggtac cggaaaacag atcgcggaat 360 

ccctgcatac cagccagcat caacgccgca ctgtgcttgc ggctacgctc atagcgacgc 420 

agataaatgt actgcccgat gtc " " 44 o 
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<210> 432 
<211> 638 
<212> DNA 

<213> Escherichia coli 



<400> 432 

cagggggttt gttgtgggca atgatgcatt taagttatcg tctgcagata gaggagatat 60 

tacaataaac aacgaatcag ggcatttgat agtcaatacc gcaattctat caggagatat 120 

agtcactcta agaggaggag aaattaggtt ggtattatag cttgtgcgcg ccatgattgg 180 

cgcgcaattt aaacttagtg ctttacatcg ctattgtctt gatttctttg aattatttta 240 

taaattaaaa aaacgactgt tatgtataag caaaggtccg aacgaaaaat acattccaaa 300 

taaatgcttg cttaaatctc tatatccttc cccgaaaaat gacacataaa attgagatat 360 

tccaaaaaga gatactacaa ataaagatgc ctttatttta ttatttctaa taaaaataga 420 

agcaataaaa aataataaca atgatataaa tctaatgttt ttaaatatat tgtcttttat 480 

gttagtaata gtcgttagta tgtttgattc tccatatatt acgtgtagtt ttttatatac 540 

atggaaataa ttttctttat actgagacat cacaccatca tcaaatggaa gtttgaagat 600 

ggtgcttggt ttgctaacca ataaaaagag tgcattcg 638 

<210> 433 
<211> 299 
<212> DNA 

<213> Escherichia coli 



<400> 433 

ctttacctgg catgatccac ttcgccagaa taccggcaat aagcccaaaa ataatccatg 60 

acagaatgcc cattgtttcc tcacttatct gttttgcatt agcgggttag tcgctgataa 120 

aaagcatagc acaacatcgg gagggcaaga tttgtgacga gcatcacgga ggtttttttg 180 

cgatggcgca gaaattgcgc catcaacgat cagtgataat taccaaccac aaacatcatg 240 

ttcgttttcc gtgtcataag aacgtacggt attcaccaga tcttttatca cttcagccg 299 

<210> 434 
<211> 388 
<212> DNA 

<213> Escherichia coli 



<400> 434 

gaaaaaggag gcaatatcgg gtaaaggcat tagcccgacg aatacgtcgg gctacaaata 60 

ttattgtgct gcaggtgttt tagcgggttg ttgatccaca ggttctaact ggaagaccac 120 

atcgacctga tcatcaaact gaatagcggc ctgctcgtaa gtttcctggg cggacaccgg 180 

cgcggcatcg gctttcatca tccgcaccat tgggctgggc tgatagttgg aaacatggta 240 

gcgcacgcta tataccggcc ccagtttacg atgaaagccg ttcgccagtt cctgcgcctg 300 

atgaatcgcg ttatcaatcg ctgccttacg cgctttgtct ttataggcat ccggctgcgc 360 

cacgcccagc gacacagaac gaattccc 388 

<210> 435 
<211> 351 
<212> DNA 

<213> Escherichia coli 



<400> 435 

ctatccttga tgaaaccgcg agcaaagata ggtgattacg tcatggtttt acagaaaatt 60 

acagaaaaag gaggcaatat cgggtaaagg cattagcccg acgaatacgt cgggctacaa 120 

atattattgt gctgcaggtg ttttagcggg ttgttgatcc acaggttcta actggaagac 180 

cacatcgacc tgatcatcaa actgaatagc ggcctgctcg taagtttcct gggcggacac 240 

cggcgcggca tcggctttca tcatccgcac cattgggctg ggctgatagt tggaaacatg 300 

gtagcgcacg ctatataccg gccccagttt acgatgaaag ccgttcgcca g 351 

<210> 436 
<211> 762 
<212> DNA 

<213> Escherichia coli 
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<220> 

<221> misc_f eature 
<222> (1) . . . (762) 
<223> n = A,T,C or G 



<400> 436 

aattatgaaa cactgtctgg aatcgtctga atgacgggca catttgcgag cacgcatcca 60 

gtaataacac aggaaactat tttatctacg cgttagcgat agactgcttg catggcgaaa 120 

ggaggtaagc cgacgatttc agcgggacgc tgaaacggga aagcccctcc cgaggaaggg 180 

gccataaata aggaaagggt catgatgaag ctactcatca tcgtggtgct cttagtcata 240 

agcttccccg cttactaaga ctaccagggc gggggaaacc ccgctctacc ctcactcctg 300 

aaagtatgcc ttcacgataa gattgtcaat ccgcaggctt tgtagtctgc gatcctgcca 360 

gcaaatattc tttgcgagtc gttacgcaat aatcacagag gaaactattt tattcacgcg 420 

ttagcgatag actgcattca gggcgaaagg aggtaagccg atgatttcag cgggacgctg 480 

aaacgggaaa gcctctcccg gagaagaggg cttttaataa ggaaagggtt atgatgaagc 540 

acgtcatcat actggtgata ctcttagtga ttagcttcca ggcttactaa gaacaccagg 600 

gggaggggga aacctcttcc taaccctcac ttctgaaatt gggtgctatg acgctggcgt 660 

tactgcttan cgctaccagt ttgtctgccc tggcggttgt aacgccagat cggtacccgt 720 

ttggatattt taatgaaagc cgacaaatca atcancgtga eg 762 

<210> 437 
<211> 292 
<212> DNA 

<213> Escherichia coli 



<400> 437 

cacatttgeg agcacgcatc cagtaataac acaggaaact attttatcta cgcgttagcg 60 

atagactget tgcatggcga aaggaggtaa gecgacgatt teagegggae getgaaaegg 120 

gaaagcccct cccgaggaag gggccataaa taaggaaagg gtcatgatga agctactcat 180 

catcgtggtg ctcttagtca taagcttccc cgcttactaa gactaccagg gcgggggaaa 240 

ccccgctcta ccctcactcc tgaaagtatg ccttcacgat aagattgtca at 292 

<210> 43B 
<211> 631 
<212> DNA 

<213> Escherichia coli 



<400> 438 

atttacacrt tttacgaaat catgggatca ctaacaaaat atcgcttgtc agttatattg 60 

tatggcagga aagatatgcg actgatatta cagatcccca aagtggagag tttatgacca 120 

ttaaaaataa gatgttgctg ggtgcgcttt tgctggttac cagtgccgcc tgggccgcac 180 

cagccaccgc gggttcgacc aatacctegg gaatttctaa gtatgagtta agtagtttca 240 

ttgetgaett taagcatttc aaaccagggg acaccgtacc agaaatgtac cgtaccgatg 300 

agtacaacat taagcagtgg cagttgcgta acctgcccgc gcctgatgcc gggaegcact 360 

ggacctatat gggtggcgcg tacgtgttga tcagcgacac egaeggtaaa atcattaaag 420 

cctacgacgg tgagattttt tatcatcget aaaaaaagee ccctcatcat gagggggaaa 480 

tgcagacacc ttgttatttt ttattattag ccacttgctc gtcttgcttg ttattagtcg 540 

tatttcacgt tgattaatgc ggttgcctcc agtgcgccag atttaacttt gtttgtatcg 600 

tagaegtagt aactggctgt tateggaatt g " " 631 

<210> 439 
<211> 566 
<212> DNA 

<213> Escherichia coli 



<400> 439 

tatggcagga aagatatgcg actgatatta cagatcccca aagtggagag tttatgacca 60 

ttaaaaataa gatgttgctg ggtgcgcttt tgctggttac cagtgccgcc tgggccgcac 120 

cagccaccgc gggttcgacc aatacctegg gaatttctaa gtatgagtta agtagtttca 180 

ttgetgaett taagcatttc aaaccagggg acaccgtacc agaaatgtac cgtaccgatg 240 
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agtacaacat taagcagtgg cagttgcgta acctgcccgc gcctgatgcc gggacgcact 300 

ggacctatat gggtggcgcg tacgtgttga tcagcgacac cgacggtaaa atcattaaag 360 

cctacgacgg tgagattttt tatcatcgct aaaaaaagcc ccctcatcat gagggggaaa 420 

tgcagacacc ttgttatttt ttattattag ccacttgctc gtcttgcttg ttattagtcg 480 

tatttcacgt tgattaatgc ggttgcctcc agtgcgccag atttaacttt gtttgtatcg 540 

tagacgtagt aactggctgt atcgaa 566 

<210> 440 
<211> 339 
<212> DNA 

<213> Escherichia coli 
<400> 440 

cgtattcaca tccttttgat tggtgataac atgcgaatcg gtattatttt tccggttgta 60 
atcttcatta cagcggtcgt atttttagca tggtttttta ttggcggcta tgctgccccg 120 
ggagcataaa gatgaaaaaa acaacgatta ttatgatggg tgtggcgatt attgtcgtac 180 
tcggcactga gctgggatgg tggtaacgtc acctctaaaa aatagcaaag gctgcctgtg 240 
tgcagccttt gtgcaattta agcgttaact tttaatcttc ctgtagataa atagcacgac 300 
aatcgcacca ataacggcaa ccacgaagct gccaaaatt 339 

<210> 441 
<211> 376 
<212> DNA 

<213> Escherichia coli 
<400> 441 

catgaatatt taaaaaggaa aacgacatga aaccgaagca cagaatcaac attctccaat 60 
cataaaatat ttccgtggag cattttatta ttgaatatag aggtttaact ccggtaaaaa 120 
acaaagaagc attgaatgca gggaaaaata atatggccat aaaaaacatc gaaagaaact 180 
cttttaattt aacatgtaaa cgcatggtta atcctcatat cacgggtgga gtgttaagaa 240 
catacataaa tggagtcatg ttttcccttt tccatttatc aagttcctgt tgccgtttta 300 
gtccatctct aattgcatat tttaattttt ctgataaatg gcattgagca tcgatttcat 360 
ttaaaacaac tgtaca 37g 

<210> 442 
<211> 446 
<212> DNA 

<213> Escherichia coli 
<400> 442 

ttacgatagc tattagtaaa aatataagag ttagctgtat tgttatgtct gtggcgaaat 60 
tgactacctt cgtttttttg attaagaatg attttattat cgtaagtaaa attacatgaa 120 
tatttaaaaa ggaaaacgac atgaaaccga agcacagaat caacattctc caatcataaa 180 
atatttccgt ggagcatttt attattgaat atagaggttt aactccggta aaaaacaaag 240 
aagcattgaa tgcagggaaa aataatatgg ccataaaaaa catcgaaaga aactctttta 300 
atttaacatg taaacgcatg gttaatcctc atatcacggg tggagtgtta agaacataca 360 
taaatggagt catgttttcc cttttccatt tatcaagttc ctgttgccgt tttagtccat 420 
ctctaattgc atattttaat ttttct 445 

<210> 443 
<211> 388 
<212> DNA 

<213> Escherichia coli 
<220> 

<221> misc_f eature 
<222> (1) . . . (388) 
<223> n = A, T,C or G 

<400> 443 

tcaccccggt gccgattttc aggcatcctg atttaactta gcacccgcaa cttaactaca 60 
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ggaaaacaaa 
cggactttta 
tcacaatagg 
tcgcatcttt 
ccgctttttc 
cncgcttttc 



gagataaatg 
cccgcctggt 
ctgtgtagcc 
atctttggct 
gcgctccaga 
ttcctgacga 



tctaatcctg 
ttattaattg 
tgggcctgtt 
gcaggttgaa 
cgagcctggc 
atagccgc 



<210> 444 
<211>.209 
<212> DNA 

<213> Escherichia coli 



atgcaaatcg 
cactgtnatc 
tctctttcac 
cggctgcgct 
gcgcttcgaa 



agccgatttt 
cgggcgttcg 
ccgcgccaga 
cttatgtcgt 
acgcgctttg 



ttaatcttta 
cccgctttaa 
gcggcagcaa 
tcaaggcgag 
gcttctgcgg 



120 
180 
240 
300 
360 
388 



<400> 444 

aattttaata acgctatctg cggataaagc agaataggtg gttaacccca gacataaacc 
gaggaaaata atgttattgt atttcataat ctattgttcc ttagcgacag attgctgtct 
gctggttcag taaggtacca ggagaaactt caggaagctt gtactcgaca atacagtttg 
agtttttatc tttgccccat gaaacctgt 

<210> 445 
<211> 341 
<212> DNA 

<213> Escherichia coli 



<400> 445 

catcctcaat accgttaaat gcaacccgaa cccccgttgt ccctttgctg cattcactta 60 

acgtaatctg aaaagggacg gctggacttg tgctaccggt cgttggaaat tgtctggcac 120 

tgtttttttg gagatctacg gtaaaattaa gcgaatccga tgagactgtg cagccataat 180 

cgaggacgcg cccgctaatt ttaataacgc tatctgcgga taaagcagaa taggtggtta 240 

accccagaca taaaccgagg aaaataatgt tattgtattt cataatctat tgttccttag 300 

cgacagattg ctgtctgctg gttcagtaag gtaccaggag a " 341 

<210> 446 
<211> 697 
<212> DNA 

<213> Escherichia coli 



<400> 446 

agatttactg ccaatttccg gcagatcgga aagggttaam ccatattgat ccataagggt 60 

acgaatcmcg ggctataccg ccaggcatgg cttgagccat ggcattaaat tccgcaaatt 120 

cgggcgctga ttcttcccac gcggttattt tggcacacac cagatccagc aaggggtttt 180 

caggatcgtt gagcagcaga tgatctacca gttccagcgc ctgggtgtat tgttcctcgt 24 0 

tctgaatacc cgccagaaaa ggtgccacag cagttagctt ttctcctgct tgcaagatgt 300 

cggcaatcgc aatcattttt tccccttagt acgatgaaca gcggtaaaga aatcgtattc 360 

tttatgcgtc ataacttcac gtatgtagca cttttgcgat tcaaaaaaga ccattgctac 420 

aacacgtaat tcattgcccc caacattgaa aacataatgc ttatccagat atttgaagtt 480 

atccagagat gggaatactg cttttaatga ctcaggtttt ttgaaatatc ccttagcaat 540 

cgtgktcccc agagccacca actccgtttt atgttgcggg tatttttccg cagcatcttt 600 

caatgctttt tgagttatca ggtgcattct tcatcacgtc cgtkgmcaaa ttggcaatat 660 

gataacatcc gttgccagat tggcacggat gaattat 697 

<210> 447 
<211> 215 
<212> DNA 

<213> Escherichia coli 



<400> 447 

aattaataac ttttcgttag gcagttttgg gtgtgagttg caagagggga gactactgaa 60 

taactcaagt tttataatcg aggggaaaat ggtgatggcg ttcatagcaa aacgccctca 120 

accataaagg tcgagggcgc ttaagatgtt aaaaacccgc tatccgttaa aaaacaatgt 180 

tcaactaagg tcagtgacat tgcgctaaaa aagcg 215 
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<210> 448 
<211> 395 
<212> DNA 

<213> Escherichia coli 



<400> 448 

gcattattca 

ttaaggttta 

gcattgctat 

taagggacag 

attgaatgtt 

aaattcatat 

atattgaaat 



tgagaaatgt 
attatctgtg 
aatattggtt 
gcatagagta 
gacgctatgt 
aatgcgcctt 
agtccaacta 



gtatcgtaaa 
tgtgatattt 
atcatttgct 
atgatacgta 
gtttatgagg 
tgctcatgaa 
cttctttatt 



tcaactgaaa 
tattgaatgt 
gaatggattc 
tgcataacca 
gagaggtatt 
tggatgccag 
accaa 



<210> 449 
<211> 641 
<212> DNA 

<213> Escherichia coli 



ttaacgcaac 
tttaaatatt 
agtcttaatg 
acatctttac 
ttcagttgat 
tatgtagtgg 



catttgttat 
gtttttattg 
agtgggtttt 
tcattatgtc 
ctggattgtt 
gaaattataa 



60 
120 
180 
240 
300 
360 
395 



<220> 

<221> misc_feature 
<222> (1) . . . (641) 
<223> n = A,T,C or G 



<400> 449 

ataatcaggt 

cgcgtggcaa 

atgatatctt 

ctatacttat 

attgattatt 

aaattaatta 

gcgctcctgt 

gnntgcncgn 

cgatngtcgg 

cgaaaggctn 

tcctgagnan 



aagaaaaggt 
tacatcagtg 
ccgatttatc 
tcagcactca 
ttcctgattg 
atgtcatcag 
catgtgcatt 
cgncccacct 
cgggngcctc 
attncaaaga 
gacaaatncc 



gcgcggagat 
gcaataaaac 
ttaatcgttt 
caaataaagg 
ggctactggt 
gtccgaaaat 
gcttcatata 
naccccatgc 
cccatgcnan 
ctgggccttn 
gccgggagcg 



<210> 450 
<211> 314 
<212> DNA 

<213> Escherichia coli 



taccgtgtgt 
gacatatcca 
atggataacg 
aacgccaatg 
ggtgactggc 
aacgagaata 
atcactggcg 
cgaacttcag 
agtangggaa 
cntttatctg 
gatttgaacn 



tgcgatatat 
gaaaaatata 
gcaaagggct 
aaaattatac 
gtatttaaga 
tttcagtctc 
caaggagcgc 
aantgaaaac 
ntgccangcg 
atgtttgtcg 
t 



tttttagttt 
cactaagtga 
tcgttttttc 
tctgggctgt 
tgatatttta 
tcatcctgtt 
cgcaggcgna 
nccntaacnc 
ncnnattaaa 
gagaacgctc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
641 



<220> 

<221> misc_feature 
<222> (1) . . . (314) 
<223> n - A,T,C or G 



<400> 450 

gaactacgag taagaatagc tncgaattcc cgtttatgga taacggcaaa gggcttcgtt 60 

ttttcctata cttattcagc actcacaaat aaaggaacgc caatgaaaat tatactctgg 120 

gctgtattga ttattttcct gattgggcta ctggtggtga ctggcgtatt taagatgata 180 

ttttaaaatt aattaatgtc atcaggtccg aaaataacga gaatatttca gtctctcatc 240 

ctgttgcgct cctgtcatgt gcattgcttc atataatcac tggcgcaagg agcgcgcagg 300 

gggntntnnt cttt " 314 

<210> 451 
<211> 236 
<212> DNA 

<213> Escherichia coli 



<400> 451 
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atatacacta agtgaatgat atcttccgat ttatcttaat cgtttatgga taacggcaaa 
gggcttcgtt ttttcctata cttattcagc actcacaaat aaaggaacgc caatgaaaat 
tatactctgg gctgtattga ttattttcct gattgggcta ctggtggtga ctggcgtatt 
taagatgata ttttaaaatt aattaatgtc atcaggtccg aaaataacga gaatat 

<210> 452 
<211> 418 
<212> DNA 

<213> Escherichia coli 



<400> 452 

cggagattac cgtgtgttgc gatatatttt ttagtttcgc gtggcaatac atcagtggca 60 

ataaaacgac atatccagaa aaatatacac taagtgaatg atatcttccg atttatctta 120 

atcgtttatg gataacggca aagggcttcg ttttttccta tacttattca gcactcacaa 180 

ataaaggaac gccaatgaaa attatactct gggctgtatt gattattttc ctgattgggc 240 

tactggtggt gactggcgta tttaagatga tattttaaaa ttaattaatg tcatcaggtc 300 

cgaaaataac gagaatattt cagtctctca tcctgttgcg ctcctgtcat gtgcattgct 360 

tcatataatc actggcgcaa ggagcgcgca gggggcggcc aatcgccgcc gccccctg 418 

<210> 453 
<211> 551 
<212> DNA 

<213> Escherichia coli 



<400> 453 

aacaatttgc ccatgcgctc ggtcatgcgc tgcatcgccc ggccattttg sgcgtccccg 60 

cgaccgccat tcgactgtta atgggcgaat cttcagtact ggtattaggt ggacaacgcg 120 

cgctgcctaa acggctggaa gaagcgggtt ttgcgtttcg ctggtacgat ttagaagagg 180 

cgctggcgga tgtcgttcgc tgatgtggtt tacagcaaac atccgccagt taactcccgg 240 

tgttacagga ttagtggctt tgcgcgataa gatcgtctgg tgaaagtcgg gtcaccatca 300 

taactaactc tctgtctaaa cctctatcca gcatctcctg agcaatacgc agggcttctt 360 

cgtgtttgcc ctgcattgcg ccttcttcac gtaatctgtc agcaatggtc atcaagtttc 420 

tccttttctt gtggtgcgcg ttccgctatc tcaccaataa atgcacgaaa acgctgggca 480 

tcccctgttt gtaatacgta attaaacagg gcttttagct gtctgtcatt agtgktccct 540 

gtaactagca g " j 51 

<210> 454 
<211> 93 
<212> DNA 

<213> Escherichia coli 



<400> 454 

tggcatctcg gtgttgccga tcttcatgat atccagcccg ccggaaactt cttcccaaac 60 
ggttttgctg ttatccattg agtcacggaa ctg 93 

<210> 455 
<211> 232 
<212> DNA 

<213> Escherichia coli 



<400> 455 

cgtgccgaga tgatcctgta accatcatca gttgtgaagt agtgattcac gacttcaagg 60 

cgcttttcaa aagggtattt tggctttgac atattagggg ctattccatt tcatcgtcca 120 

acaaaatggg tgcagtacat actcgttgga aatcaacaca ggaggctggg aatgccgcag 180 

aaatatagat tactttcttt aatagtgatt tgtttcacgc ttttattttt ca 232 

<210> 456 
<211> 713 
<212> DNA 

<213> Escherichia coli 
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<220> 

<221> misc_f eature 
<222> (1) . . . (713) 
<223> n = A,T,C or G 

<400> 456 

ttagnggatn naangcccac ancctcgang gatctaggag gtagaatagc ttcgaattcc 60 

ccagcagagc gcggccttct tcgtcagatt tcgcagtagt ggtaatggta atatccaaac 120 

cacgaacgcg gtcgacttta tcgtagtcga tttctgggaa gatgatctgc tcacggacac 180 

ccatgctgta gttaccacga ccgtcgaaag acttagcgga caggccacgg aagtcacgga 240 

tacgaggtac agcaatagtg atcaggcgct caaagaactc ccacatgcgt tcgccacgca 300 

gagttacttt acagccgatc ggatagccct gacggatttt gaagcctgca acagatttgc 360 

gtgctttggt gatcagcggt ttttgaccgg agattgctgc caggtctgct gctgcgttat 420 

ccagcagttt tttgtcagcg atcgcttcac caacacccat gttcagggtg atcttctcga 480 

cccgagggac ttgcatgaca gaattgtagt taaactcagt catgagtttt ttaactactt 540 

cgtctttgta gtaatcatgc agtttcgcca tcgtactact ccatgtcggt gaacgctctc 600 

ctgagtagga caaatccgcc ggagccggat ttaacgttgc gaacaaccgn cccggagggg 660 

tggnggcagg accccgccat aactggcagc attaaattaa gcagaaggcc ate " 713 

<210> 457 
<211> 292 
<212> DNA 

<213> Escherichia coli 
<400> 457 

tgaacagcag agataeggee agtgcggcca atgttttttg tcctttaaac ataacagagt 60 
cctttaagga tatagaatag gggtatagct aegecagaat ategtatttg attattgeta 120 
gtttttagtt ttgcttaaaa atattgttag ttttattaaa tgcaaaacta aattattggt 180 
atcatgaatt tgttgtatga tgaataaaat ataggggggt atagatagac gtcattttca 240 
tagggttata aatgegacta ccatgaagtt tttaattgaa agtattgggt tg 292 

<210> 458 
<211> 282 
<212> DNA 

<213> Escherichia coli 
<400> 458 

ttattaaatg caaaactaaa ttattggtat catgaatttg ttgtatgatg aataaaatat 60 
aggggggtat agatagacgt cattttcata gggttataaa tgcgactacc atgaagtttt 120 
taattgaaag tattgggttg ctgataattt gagctgttct attcttttta aatatctata 180 
taggtctgtt aatggatttt atttttacaa ttttttgtgt ttaggcatat aaaaatcaac 240 
ccgccatatg aacggcgggt taaaatattt acaacttagc aa 282 

<210> 459 
<211> 300 
<212> DNA 

<213> Escherichia coli 
<400> 459 

tctgcgttcc gctaaaaggt geaaatgetc aggacgttgc agcgttttgc gtgaccgctc 60 
ggggaaggca aaattgeetc tgggaaagca ttgcgcgggg tccggcgctc atcaacaatc 120 
ggggggcagc aaggggctga aacgggaaag cccctcccga agaaggggee ttgtataagg 180 
aaagggttat gatgaagctc gtcatcatac tggttgtgtt gttactgtta agtttcccga 240 
cttactaaca actcatcaga ggggggagaa atcctccctt acccttgttc ctttactcta 300 

<210> 460 
<211> 293 
<212> DNA 

<213> Escherichia coli 
<400> 460 
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cggggtccgg cgctcatcaa caatcggggg gcagcaaggg gctgaaacgg gaaagcccct 60 

cccgaagaag gggccttgta taaggaaagg gttatgatga agctcgtcat catactggtt 120 

gtgttgttac tgttaagttt cccgacttac taacaactca tcagaggggg gagaaatcct 180 

cccttaccct tgttccttta ctctaggttg aaaaaacaac agcgtcaata ggcctgccat 240 

gtacgaagcg agatctgtga accgctttcc ggttagcctt ttttatcctg ttg 293 

<210> 461 
<211> 359 
<212> DNA 

<213> Escherichia coli 



<400> 461 

caacacagga ggctgggaat gccgcagaaa tatagattac tttctttaat agtgatttgt 60 

ttcacgcttt tatttttcac ctggatgata agagattcac tgtgtgaatt gcatattaaa 120 

caggagagtt atgagctggc ggcgttttta gcctgcaaat tgaaagagta agagtcttcg 180 

gcgggaaatt attcccgcct tacttacggc gttgcgcatt ctcattgcac ccaaatttat 240 

tcttcacaaa aataataata gattttatta cgcgatcgat tatttatttc ctgaaaacaa 300 

ataaaaaaat ccccgccaaa tggcagggat cttagattct gtgcttttaa gcagagatt 359 

<210> 462 
<211> 673 
<212> DNA 

<213> Escherichia coli 



<400> 462 

gcaacccatg tcctgacctg ggttcggggg acaccaaaac gtgccgagat gatcctgtaa 60 

ccatcatcag ttgtgaagta gtgattcacg acttcaaggc gcttttcaaa agggtatttt 120 

ggctttgaca tattaggggc tattccattt catcgtccaa caaaatgggt gcagtacata 180 

ctcgttggaa atcaacacag gaggctggga atgccgcaga aatatagatt actttcttta 240 

atagtgattt gtttcacgct tttatttttc acctggatga taagagattc actgtgtgaa 300 

ttgcatatta aacaggagag ttatgagctg gcggcgtttt tagcctgcaa attgaaagag 360 

taagagtctt cggcgggaaa ttattcccgc cttacttacg gcgttgcgca ttctcattgc 420 

acccaaattt attcttcaca aaaataataa tagattttat tacgcgatcg attatttatt 480 

tcctgaaaac aaataaaaaa atccccgcca aatggcaggg atcttagatt ctgtgctttt 540 

aagcagagaa tacaggctgg ttacgttacc agctgccggg cctttagcgc cgctttcgat 600 

ggtgaaggac actttctgac cttcgtccag agatttgtaa ccatcgttct ggatagcaga 660 

gaagtgtacg aac " 673 

<210> 463 
<211> 630 
<212> DNA 

<213> Escherichia coli 



<400> 463 

tggtggcatt ggttgctgga gagagaaaac ccccgcacgt tgcaggtatg cacctgacaa 60 

caccacgggg gctaatcttg actctagacc actcaagaat agccgcgaaa cgttgtcatt 120 

acaacacagg cggctatatg acgttcgcag agctgggcat ggccttctgg catgatttag 180 

cggctccggt cattgctggc attcttgcca gtatgatcgt gaactggctg aacaagcgga 240 

agtaacgtgt catgcgggcg tcaggctgcc gtaatggcaa tttgcgcccg gaccaggccg 300 

caggggggaa actctgcggc ctttttcgtt cttactgcgg gtaaggcacc cagtcgccgc 360 

cgttcaggcg aacgtacggt ttatcctggt attgaataac tactgcattt gagttctcgg 420 

agaccggtgc tgtttgtggc aacccactgg tgagtttttt ccagtcaaca ttgtcttcgg 480 

tgaaaatctt gccatcgaga acgcgaacca ccagatcgga gatagccagg aagctgctcg 540 

gttgttcgat gacaatcggt gccccctgat gcggtgcctt catgccgaag aatttcaccc 600 

caacggggac gtcggtgata gacgggctag 630 

<210> 464 
<211> 391 
<212> DNA 

<213> Escherichia coli 
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<400> 464 

ctcaggctgc tgattgtttt tttgtgcaat ggcgcggtat tagcgtcgtt gctgtcgatg 60 

gagagaatca taaacgtggt gaatgatgat tgttagcaag gaaaactgtc aaaaatcttc 120 

aaaaaatttg agggataagg ccggaatggc tccggccaga gggaagttaa ccgcgaagct 180 

gttgctgctt gagggtcgtt ttaaccagac gccaggcgct ccatacgcca aaaccgcgtc 240 

tggcccagcg gaccagcata ttaggatggc gaatcgtcca gatcgccatc acgctactgc 300 

caaccagcgc ccaggagcgc agacttagca gcatattcca gcgacgatcg taagcgcctg 360 

ttgtctccag ccattcacga cgactggcgg a ~* ^91 

<210> 465 
<211> 625 
<212> DNA 

<213> Escherichia coli 



<400> 465 

aacacaccac accataaacg gaggcaaata atgctgggta atatgaatgt tttaatggcc 60 

gtactgggaa taattttatt ttctggtttt ctggccgcgt atttcagcca caaatgggat 120 

gactaatgaa cggagataat ccctcaccta accggcccct tgttacagtt gtgtacaagg 180 

ggcctgattt ttatgacggc gaaaaaaaac cgccagtaaa ccggcggtga atgcttgcat 240 

ggatagattt gtgttttgct tttacgctaa caggcatttt cctgcactga taacgaatcg 300 

ttgacacagt agcatcagtt ttctcaatga atgttaaacg gagcttaaac tcggttaatc 360 

acattttgtt cgtcaataaa catgcagcga tttcttccgg tttgcttacc ctcatacatt 420 

gcccggtccg ctcttccaat gaccacatcc agaggctctt caggaaatgc gcgactcaca 480 

cctgctgtca cggtaatgtt gatatgccct tcagaatgtg tgatggcatg gttatcgact 540 

aactggcaaa ttctgacacc tgcacgacat gcttcttcat cattagccgc tttgacaata 600 

atgataaatt cttcgccccc gtagc g25 

<210> 466 
<211> 623 
<212> DNA 

<213> Escherichia coli 



<400> 466 

tgcttttgaa tatgtgctcg caatcttgag aaggaaatgg cgaccacgaa agaaaaggca 60 

aaaacgataa tctgaaagag ccaaggtatt tcagtataag cattgaatgc gacagtaaac 120 

tctttcggta tcagccagag agtgagacca aaaatgataa tcgtatacat aagtctttcg 180 

agtggctcgt tagcaaaaag tttcaacaat ggagtaaata catccaacat atcaataact 240 

ctcaactgta agggtattga aatgttaaca caagctctcg ctgtaggggt atagccgaga 300 

ccaccgaagc ccggaggtgg tgaaataaaa ccgggcacaa cacgaaggcg catttccgat 360 

atccataaag agtcggtctt gtctgttaaa tttaaatggt gggagtgcgc ctccggttgt 420 

aaataacgac attgctgtgt gtagtcctgg cggcatcagt ttttttcttg aagttcggct 480 

gatgtccgcc ctttttaaag tgaattttgt gatgcggtga atgcggctaa gcgcacgtgg 540 

cacagttaaa agtcatgtta gtccttattg gtttgggtgg gaaagccgac tgtaattgtt 600 

aactggttgc agtcacctgg agg " 62 3 

<210> 467 
<211> 234 
<212> DNA 

<213> Escherichia coli 



<400> 467 

tgtttactta caagagattc atctttgtat aaataaagat aagtaattac gcataaaaca 60 

acaatgatta taatagcaaa aataaatatt atcatctttg atagattact tgagatagcc 120 

agcatcttgt aaagccttta tcgttttttt atgctctgga ttaatataat cactacatct 180 

atctgagcaa tctgttgttg atggacatgt caacccatgg tcatttacag ccaa 234 

<210> 468 
<211> 529 
<212> DNA 

<213> Escherichia coli 
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<400> 468 

attagctatt 

tagttccgtt 

gctggtaatc 

atgaacaact 

tattttctgt 

ctccaactta 

ccaccgattt 

tcaggttatg 

agtggttttg 



tcggctaaaa tagagactac atgtcttcgg tccatctcac ttaaggagtg 
gtaagttttt ccatagcttg cactgctaaa tttcgaacaa ggaattttct 
tctaaaaaga tggcatggtt tacaatgatt tttgtttcct tttgattatt 
gtccatgatt tcgtttaaga atgaagagaa atcactaaac gaactgaata 
gccaatatta tctctaattt caaaaaagtt acttttaatg tcggtaatga 
ttgatagtgt tttatgttca gataatgccc gatgactttg tcatgcagct 
tgagaacgac agcgacttcc gtcccagccg tgccaggtgc tgcctcagat 
ccgctcaatt cgctgcgtat atcgcttttc cttatcagtt cgttgatgtc 
accacgaggg agcttcacgc gagttattga aaaccctga 



60 
120 
180 
240 
300 
360 
420 
480 
529 



<210> 469 
<211> 261 
<212> DNA 

<213> Escherichia coli 
<400> 469 

caaagaacct tcaacatgaa aaatatccat ttgtttgcaa aaaaagatta ttaggaagga 60 
aattaatgca attatcgaaa attcaaaaaa tatccaaaaa tagtatactt tattccagaa 120 
gagttcaata taatgtttgt cttcaatttt tcttacttca gggtaatata gattgctcat 180 
tacattgtga gcttcatctt tatttaattt tctgttgact ccagctctcc gtgataacgg 240 
ttttataatt ' agatgcttat c ~ ^ 261 

<210> 470 
<211> 98 
<212> DNA 

<213> Escherichia coli 
<400> 470 

agatgattgc cgggaacttg ttagcggcac gcaggcggcg gctcgcaccc ttaccctgct 60 
ctttacgtac ttctgcgttg atagtaaaca tttctttc 99 

<210> 471 
<211> 259 
<212> DNA 

<213> Escherichia coli 
<400> 471 

agcgcgaacg aagtcgatgt gctgcagctt cggtttgtac gggtgacgct gtacgtcctg 60 
agctttaact ttgatttctt taccgtcaac aacgatggtc agaacttcgc tgtagaattc 120 
agctttagct tgcatgttca tgactttgtc gtgatccagc tcgatagcca gcggcgcttc 180 
tttgccaccg tagatgattg ccgggaactt gttagcggca cgcaggcggc ggctcgcacc 240 
cttaccctgc tctttacgt 259 

<210> 472 
<211> 94 
<212> DNA 

<213> Escherichia coli 
<400> 472 

aaaaacggcg taaagaaagg atgcaaacat gttaataaaa actcaaattg atcccacgta 60 
tatattacgc cgcaaaatcc ttacaataaa cagg 94 

<210> 473 
<211> 174 
<212> DNA 

<213> Escherichia coli 
<400> 473 

ttaattatta aaatagtgta acgcgattat gtggttatgg gggtaaacat taaataaacc 60 
agcggggagg ggaggtaaag tgaaaaaata aaaagcggat aatcttaata agcaggccgg 120 
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acagcatcgc catccggcac tgatacgagg tttatttcag ctcatcaacc atcg 174 

<210> 474 
<211> 138 
<212> DNA 

<213> Escherichia coli 
<400> 474 

ctgtaaaaac gtcaaaaaga gtgttttatc aacagaagaa tggaggtctg acagatagta 60 
gtaatgcaaa aaaatggaga cttaagttga atgaacggga gtaaagcgaa aagactatag 120 
agtgaaggag aaattccc " ^38 

<210> 475 
<211> 191 
<212> DNA 

<213> Escherichia coli 
<400> 475 

tttgttggct taatattcta ttgttatctt tatttataga tgtttatatt gcatgaggtg 60 
gtttttggag agaagaatga ggaagatgcg tcgagccaca gaaacgttag ctttacatat 120 
agcggaggtg atgtgaaatt aatttacaat agaaataatt tacatatcaa acagttagat 180 
gctttttgtc g ~ igi 

<210> 476 
<211> 245 
<212> DNA 

<213> Escherichia coli 
<400> 476 

cggccattta tacaggaaaa gcctatgtca gaacgtaaaa actcaaaatc acgccgtaat 60 
tatctcgtta aatgttcctg cccaaactgc acccaagagt cagaacacag tttttcaaga 120 
gtacaaaaag gtgccctttt gatctgccct cattgcaaca aagtattcca gacaaatctt 180 
aaagctgtag cctgattgat tttattagta acaagtattt tttatatttt aataatatat 240 
ttaaa 245 

<210> 477 
<211> 319 
<212> DNA 

<213> Escherichia coli 
<400> 477 

aaattttcag gtaccttgtc accatacttt tttttctgag cattaatgat attttgagct 60 
tcttgaggat ctttaactcc ccacatttgg tggaaagtat tcatattaaa aggaaggttg 120 
aataatttgt ctttataaat cgccagtgga gaattagtaa aacgattaaa ttctactaaa 180 
tcattaacgt aatcccatat atatttatca ttggtatgaa aaatatgtgc accatattta 240 
tgaatctgga taccctcaca gtcctctgtg tacgcatttc caccgatatg atttcttttc 300 
tcaatcacta aaacttttt ^19 

<210> 478 
<211> 149 
<212> DNA 

<213> Escherichia coli 
<400> 478 

gcagtgatcg aagcgatgac gaagtgtatg gaaaaatcag aaaaactcag caaatcctga 60 
tgactttcgc cggacgtcag gccgccactt cggtgcggtt acgtccggct ttctttgctt 120 
tgtaaagcgc caaatctgcc gatttcaac I49 

<210> 479 
<211> 330 
<212> DNA 
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<213> Escherichia coli 



<400> 479 

gaaagtatct tcgttattga catcactgga aaatataact tgcttttcat tattaaactc 60 

gaagcgcgta ccgtatctgg acaaacattt atcgagctta ccaaattcct gaagaggttt 120 

aactacagat aacatttgcg cgtcctttgc agtaatgccc gtcaaatcct tgacgggcat 180 

tatttagatt aaattaccag tatttcttcg gagtgaagaa tattaccagg tatatttaac 240 

acccacgttc gcggaccagt cttgatctac gtcaccacca ccgaggtagt tagcatcggt 300 

ataggcgctg aagttcttgg tgaagctaaa J 330 

<210> 480 
<211> 191 
<212> DNA 

<213> Escherichia coli 
<400> 480 

tttttttcca gcaacggagc aaaaggtttg cccttgtgca gctcagggtt aaccacttta 60 
actacgtggc gacgacccgg agatgtcggt ttacatttaa caactgccat tgtattactc 120 
ctccgactta ctcagcgccg ccaacgaagt ccagattctg gccttctttc agggtgacgt 180 
aagctttttt c " " 191 

<210> 481 
<211> 188 
<212> DNA 

<213> Escherichia coli 
<400> 481 

tccctttaac taccagggtg ttaacgactt cgacttcgac ttcaaacagt ttctgcacag 60 
cagctttgat ttctgctttg gtcgcgtctt tagcaacttt gagtacgatg gtgttggatt 120 
tttccatcgc agtagacgct ttttcagaaa cgtgcggtgc acgcagcacc ttcagcagac 180 
gttcttca 188 

<210> 482 
<211> 172 
<212> DNA 

<213> Escherichia coli 
<400> 482 

caaaggcgaa caaagcctgt gaagcccgaa ggctccacag acagtgctac ttgaaggcct 60 
tactgtttct tcttaggagc gagcaccatg atcatctggc ggccttcgat cttcgttggg 120 
aaggattcga ccactgccag ttcttgcaaa tcgtctttca cgcgattaag ca 172 

<210> 483 
<211> 266 
<212> DNA 

<213> Escherichia coli 
<400> 483 

tggagaaaac gggtgattga taaagcaatc atcgttctag gggcgttaat tgcgctgctg 60 
gaactgatcc gctttctgct tcagcttctg aactgatagc ggaaacgtaa ttaagggcta 120 
agagcacact actcttagcc ctttaacatt taacgcattg tcacgaactc ttctgccgcc 180 
gttgggtgaa tggcgacggt attgtcgaag tcttttttgg ttgcccccat cttcagcgcc 240 
accgcgaagc cctgcaacat ttcgtc 266 

<210> 484 
<211> 259 
<212> DNA 

<213> Escherichia coli 
<400> 484 

cgcaggcagc tgatggtcaa caggatgaga gaaacccaga gacaggttaa tcacattgcc 60 
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tttaaccgct gcacggtaac ctacaccaac cagctgcagc ttcttagtga agccttcggt 

aacaccgata accattgagt tcagcagggc acgcgcggta ccagcctgtg cccaaccgtc 

tgcgtaacca tcacgcggac cgaaggtcag ggtattatct gcatgtttaa cttcaacagc 
atcgttgaga gtacgagtc 



120 
180 
240 
259 



<210> 485 
<211> 73 
<212> DNA 

<213> Escherichia coli 
<400> 485 

caggtcggaa cttacccgac aaggaatttc gctaccttag gaccgttata gttacggccg 60 
ccgtttaccg ggg 73 
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because they relate to parts of the international Application that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be carried out, specifically: 

see FURTHER INFORMATION sheet PCT/ISA/210 



3. Q Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 

Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 

This International Searching Authority found multiple inventions in this international application, as follows: 

see additional sheet 



1. 



□ As all required additional search fees were timely paid by the applicant, this international Search Report covers all 
searchable claims. 



2. Q As all searchable claims could be searched without effort justifying an additional fee. this Authority did not invite payment 
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3. | y I As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
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1-10, 12, 14-33, 46, 49, 52, 54, 56, 64, 66, 94 and 106, all partially 



4. [^J No required additional search fees were timely paid by the applicant. Consequently, this International Search Report is 
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matter for which protection is sought, the present application fails to 
comply with the clarity and conciseness requirements of Article 6 PCT 
(see also Rule 6.1(a) PCT) to such an extent that a meaningful search is 
impossible. Consequently, the search has been carried out for those parts 
of the application which do appear to be clear (and concise), namely the 
nucleic acid sequences as identified in claims 1 and 8 respectively, 
sequences related to said sequences as well as their use. This 
corresponds to the subject-matter of claims 
1-10,12,14-33,46,49,52,54,56,64,66,94 and 106. 

It should be noted that since claim 46 has been searched, the 
subject-matter of claims 35-45 has been searched restricted to the gene 
products of claim 46, i.e. for those gene products for which (additional) 
search fees have been paid 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1-7, 12, 49, 52, 56, 66, all partially 

Invention 1: 

A purified or isolated nucleic acid sequence consisting of 
SEQ.ID. No.: 405, a vector comprising said sequence, a host 
comprising said vector, the use of said sequence for 
inhibiting cellular proliferation, a composition comprising 
said sequence, the use of said sequence for inhibiting the 
expression of a gene and the use of said nucleic acid 
sequence for identifying bacterial strains. 



2. Claims: 1-7, 12, 49, 52, 56, 66, all partially 
Inventions 2 to 81: 

Idem as invention 1, but for SEQ.ID. NO' s 406-485 respectively 



3. Claims: Claims 8-10,12,14-33,46,54,64,66,94 and 106, 
all partially: 



Invention 82: 

A purified or isolated nucleic acid consisting of 
SEQ.ID. No.: 82, a vector comprising said nucleic acid 
sequence, a host comprising said vector, a polypeptide 
encoded by said nucleic acid sequence and having the 
sequence of SEQ.ID. No.: 243, an antibody binding said 
polypeptide, a method for producing said polypeptide, a 
method for identifying compounds influencing the activity of 
said polypeptide, a method for identifying compounds 
influencing the level of said polypeptide, a method for 
inhibiting the expresion of said nucleic acid, the use of 
said nucleic acid sequence for identifying bacterial strains 
and the use of said nucleic acid sequence for identifying 
proliferation inhibitors. 



4. Claims: Claims 8-10,12,14-33,46,54,64,66,94 and 106, 
all partially: 



Inventions 83 to 242: 

Idem as invention 82, but for SEQ.ID. No's 83-88, 90-242 (and 
their corresponding polypeptide sequences, see Table II) 
respectively. 
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